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I MMUNIZ ATION FOR EBOLA VIRUS INFECTION 



FIELD OF THE INVENTION 

The present invention relates generally to viral vaccines and, more particularly, 
to Ebola virus vaccines and methods of protecting against disease caused by infection 
5 with Ebola virus. 

BACKGROUND OF THE INVENTION 

The Ebola viruses, and the genetically-related Marburg virus, are filoviruses 
associated with outbreaks of highly lethal hemorrhagic fever in humans and primates 
in North America, Europe, and Africa. Peters, C.J. et al., Filoviridae: Marburg and 

10 Ebola Viruses, in Fields Virology, (eds.. Fields, B.N., Knipe, D.M.& Howley, P.M.) 
1161-1176 (Philadelphia, Lippincott-Raven, 1996); Peters, C.J. et al , Semin. Virol. 
5:147-154 (1994). Ebola viruses are negative-stranded RNA viruses comprised of 
four subtypes, including those described in the Zaire, Sudan, Reston, and Ivory Coast 
episodes. Sanchez, A. et al., PNAS (USA) 93:3602-3607 (1996). Although several 

15 subtypes have been defined, the genetic organization of these viruses is similar, each 
containing seven linearly arrayed genes. Among the viral proteins, the envelope 
glycoprotein exists in two alternative forms, a 50-70 kilodalton (kDa) secreted protein 
of unknown function encoded by the viral genome and a 130 kDa transmembrane 
glycoprotein generated by RNA editing that mediates viral entry. Peters, C.J. et al., 

20 Filoviridae: Marburg and Ebola Viruses, in Fields Virology, (eds.. Fields, B.N., Knipe, 
D.M.& Howley, P.M.) 1 161-1 176 (Philadelphia, Lippincott-Raven, 1996); Sanchez, A. 
et al., PNAS (USA) 93:3602-3607 (1996). Other structural gene products include the 
nucleoprotein (NP), matrix proteins VP24 and VP40, presumed nonstructural proteins 
VP30 and VP35, and the viral polymerase (reviewed in Peters, C.J. et al., Filoviridae: 

25 Marburg and Ebola Viruses, in Fields Virology, (eds.. Fields, B.N., Knipe, D.M.& 
Howley, P.M.) 1161-1176 (Philadelphia, Lippincott-Raven, 1996)). Although 
spontaneous variation of its RNA sequence does occur in nature, there appears to be 
less nucleotide polymorphism within Ebola subtypes than among other RNA viruses 
(Sanchez, A. etal., PNAS (USA) 93:3602-3607 (1996)), suggesting that immunization 

30 may be useful in protecting against this disease. Previous attempts to elicit protective 
immune responses against Ebola virus using traditional active and passive 
immunization approaches have, however, not succeeded. Peters, C.J. et al., 
Filoviridae: Marburg and Ebola Viruses, in Fields Virology, (eds.. Fields, B.N., Knipe, 
D.M.& Howley, P.M.) 1 161-1 176 (Philadelphia, Lippincott-Raven, 1996); Clegg, J.C.S. 
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etal., New Generation Vaccines, (eds., Levine, M.M., Woodrow, G.C., Kaper, J.B.& 
Cobon, G.S.) 749-765 (New York, NY, Marcel Dekker, Inc. 1997); Jahrling, P.B. etal.. 
Arch. Virol. Suppl. 11:135-140 (1996). 

It would thus be desirable to provide a vaccine to protect against disease 
5 caused by infection with Ebola virus. It would further be desirable to provide methods 
of making and using said vaccine. 

SUMMARY OF THE INVENTION 
Ebola virus vaccines comprising nucleic acid molecules encoding Ebola viral 
proteins are provided. In one embodiment, the nucleic acid molecule encodes the 
10 transmembrane form of the viral glycoprotein (GP). In another embodiment, the 
nucleic acid molecule encodes the secreted form of the viral glycoprotein (sGP). In 
yet another embodiment, the nucleic acid molecule encodes the viral nucleoprotein 
(NP). 

The present invention also provides methods for immunizing a subject against 

15 disease caused by infection with Ebola virus comprising administering to the subject 
an immunoeffective amount of an Ebola virus vaccine. Administration can be by any 
of the routes normally used for gene therapy. In a preferred method, the Ebola virus 
vaccine is administered by intramuscular injection. The genetic immunization 
methods of the present invention provide protective immunity against disease caused 

20 by infection with Ebola virus. 

Additional objects, advantages, and features of the present invention will 
become apparent from the following description and appended claims, taken in 
conjunction with the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 The various advantages of the present invention will become apparent to one 

skilled in the art by reading the following specification and subjoined claims and by 
referencing the following drawings. 

Figures 1A and IB are photographs showing expression of Ebola virus gene 
products in eukaryotic plasmid expression vectors. 

30 Figure 1A. Expression vectors encoding the indicated viral gene products 

under regulation of the CMV immediate-early region 1 enhancer and promoter were 
prepared and transfected into 293 cells as previously described. Manthorpe, M. et 
al. Hum. Gene. Tiier 4:419-431 (1993); Sambrook, J., Fritch, E.F., & Mantatis, T. 
Cold Spring Harbor, N.Y. Cold Spring Laboratory Harbor Press, 1994. Cell extracts 

35 were prepared and analyzed by Western blot analysis for NP (left) or GP (right) using 



wo 99/32147 



PCT/US98/27364 



- 3 - 

relevant rabbit antisera and a secondary antibody, horseradish peroxidase conjugated 
donkey anti-rabbit IgG of a dilution of 1:5,000. Incubation with primary antibody was 
for 30 minutes at room temperature, and for 30 minutes at room temperature with 
secondary antibody. Immunocomplexes were then detected by chemiluminescence 
5 using super signal substrate reagents (Pierce) according to manufacturer's 
instructions. 

Figure 1B. Generation of antibody response in mice immunized with the 
indicated vectors and analyzed by Western blot for NP, GP, and sGP as shown. 
Antisera from mice were tested at a dilution of 1:500 (NP), 1:50 (GP), or 1:50 (sGP), 

10 respectively, and developed with a secondary antibody (sheep anti-mouse, 1:5,000, 
Amersham Life Science) and chemiluminescence as in Figure 1 A. The control vector 
used for immunization represents the expression vector plasmid with no insert. 
Manthorpe, M. et al.. Hum. Gene. Then 4:419-431 (1993). 

Figures 2A-2D are graphs showing the immune responses to NP and GP after 

15 genetic immunization in mice. 

Figure 2A. Splenic lymphocytes from vector or NP-plasmid immunized mice 
were isolated approximately 6 weeks after the initial immunization and sensitized in 
vitro for 5 days with 10 U/ml hlL-2. Renca-NP cells sensitized splenocytes from 
vector-immunized or pCMV-NP immunized mice were used to detect CTL activity at 

20 the indicated effectortarget ratios on Renca or Renca-NP cells (left, middle) or with 
allogeneic effector cells with Renca-NP to show that they are susceptible to lysis 
(right). Allogeneic effector cells were generated by incubating cells derived from mice 
with a C57BI/6 background (5x10^/ml) with irradiated Balb/c spleen cells (5x10^/ml) 
in the presence of IL-2 (20 U/ml) for five days. The chromium release CTL assay with 

25 Renca-NP cells was performed in triplicate as previously described. Ohno, T. et al., 
Gene. Then 4:361-366 (1997). 

Figure 2B. Balb/C female mice were immunized with the sGP plasmid 
expression vector and analyzed for their ability to lyse the syngeneic Renca cell line 
stably expressing GP. Isolation of stable transfectants, confirmation of expression, 

30 and CTL assay were performed as described ( see . Specific Example, II. Methods). 
Renca-GP or sGP sensitized splenocytes from pCMV-GP or pCMV-sGP immunized 
mice were used to determine the specific killing of chromium labeled Renca-GP cells 
at the indicated E/T ratios. 
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Figure 2C. Mice immunized witin GP were analyzed for their ability to lyse a 
syngeneic CT26 cell stably expressing GP or CT26 vector control transduced line at 
the indicated E/T ratios. 

Figure 2D. Cellular proliferative response in the indicated immunized mice. 
5 T cells, enriched or depleted (see, Specific Example, II. Methods), were incubated at 
10^ cells/mi vi/ith sGP condition media (25%). Background was determined with cells 
incubated in media from control transfected 293 cells and subtracted from proliferation 
seen in sGP-containing supernatants. 

Figures 3A-3C are graphs showing immunization with sGP or GP expression 
0 plasmids induces T cell responses to sGP in guinea pigs. 

Figures 3A'3C. Cell-mediated immunity in guinea pigs was analyzed by 
performing assays to detect cell proliferation to control or GP antigen (A) or T-cell 
growth factor production in response to the indicated antigens. The culture 
supernatants containing these antigens were prepared as previously described 
5 (Bottomly, K. et al.. Measurement of human and murine interleukin 2 and interleukin 
4. in Current Protocols in Immunology, (eds., Coligan, J.E., Kruisbeek, A.M., 
Margulies. D.H., Shevach, E.M.& Strober, W.) 6.3.1-6.3.12 (New York, John Wiley & 
Sons, Inc. 1992); Arai, H. et al., Nat. Med. 3:843-848 (1997)), and included at a final 
concentration of 1 0% (volume/volume). In A, cell numbers refer to the concentration 
0 of spleen cells per ml in the ^H-thymidine proliferation assay. In B, supernatants from 
A, harvested at the time of the peak proliferative response to sGP, were incubated 
with primary guinea pig T cells maintained in 200 U/ml of human IL-2. The percent 
maximal response refers to the magnitude of stimulation in response to the indicated 
stimuli relative to supernatants from 24 hour concanaval (in A-stimulated cells (2 
5 //g/ml)). The requirement of T lymphocytes in guinea pig spleen cells for the 
proliferative response to sGP. performed as described in Specific Example, II. 
Methods, is shown (C). 

Figures 4A-4F are photographs showing the immunohistochemical analysis of 
Ebola virus antigens in liver, lung, and spleen from representative protected (GP- 
0 animal 3) or infected (vector-animal 2) guinea pigs. 

Figures 4A-4F. Magnification: liver, 40x; lung, 20x; spleen, 20x. 
Figure 5 is a schematic of the plasmid pVR 1012-GP(IC) (Ivory Coast strain 
of GP, SEQ ID NO: 1). 

Figure 6 is a schematic of the plasmid pVR 1012-GP(S) (Sudan strain of GP. 
5 see SEQ ID NO: 2). 
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Figure 7 is a schematic of the plasmid pVR 1012-GP(Z) (Zaire strain of GP, 
see SEQ ID NO: 3). 

Figure 8 is a schematic of the plasmid pVR 1 012-sGP(Z) (Zaire strain of sGP, 
see SEQ ID NO: 4). 
5 Figure 9 is a schematic of the plasmid pVR 1012-NP. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Ebola virus vaccines are provided comprising a nucleic acid molecule encoding 
an Ebola viral protein operatively-linked to a control sequence in a pharmaceutically 
acceptable carrier. In one embodiment, the nucleic acid molecule encodes the 
10 transmembrane form of the viral glycoprotein (GP). In another embodiment, the 
nucleic acid molecule encodes the secreted form of the viral glycoprotein (sGP). In 
yet another embodiment, the nucleic acid molecule encodes the viral nucleoprotein 
(NP). 

The present invention further includes vaccines comprising nucleic acid 
15 molecules encoding Ebola viral proteins other than GP, sGP, and NP, e.g., other 
structural gene products which elicit protective immunity from disease caused by 
infection with Ebola virus. The nucleic acid molecules of the vaccines of the present 
invention encode structural gene products of any Ebola viral strain including the Zaire, 
Sudan, Ivory Coast and Reston strains. Nucleic acid molecules encoding structural 
20 gene products of the genetically-related Marburg virus strains may also be employed. 
Moreover, the nucleic acid molecules of the present Invention may be modified, e.g.. 
the nucleic acid molecules set forth herein may be mutated, as long as the modified 
expressed protein elicits protective immunity from disease caused by infection with 
Ebola virus. For example, the nucleic acid molecule may be mutated so that the 
25 expressed protein is less toxic to cells. The present invention also includes vaccines 
comprising a combination of nucleic acid molecules. For example, and without 
limitation, nucleic acid molecules encoding GP, sGP and NP of the Zaire, Sudan and 
Ivory Coast Ebola strains may be combined in any combination, in one vaccine 
composition. 

30 The present invention also provides methods for immunizing a subject against 

disease caused by infection with Ebola virus comprising administering to the subject 
an immunoeffective amount of an Ebola virus vaccine. Methods of making and using 
Ebola virus vaccines are also provided by the present invention including the 
preparation of pharmaceutical compositions. 
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As referred to herein, the term "encoding" is intended to mean that the subject 
nucleic acid may be transcribed in a cell, e.g., when the subject nucleic acid is linked 
to appropriate control sequences such as a promoter in a suitable vector (e.g., an 
expression vector) and the vector is introduced into a cell. The nucleic acid 
5 molecules of the present invention may be DNA molecules, cDNA molecules or RNA 
molecules, and are preferably cDNA molecules. The term "operatively-linked" as 
used herein refers to functional linkage between a nucleic acid expression control 
sequence (such as a promoter) and a second nucleic acid sequence, wherein the 
expression control sequence directs transcription of the nucleic acid corresponding 

10 to the second sequence. Expression control sequences are known to those skilled 
in the art (see, e.g., Goeddel, Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, CA (1990)). Vectors which contain both a promoter 
and a cloning site to which an inserted piece of nucleic acid is operatively-linked to 
the promoter, are well known in the art and are generally referred to herein as 

15 "expression vectors" or "expression vector plasmids". Preferably, these vectors are 
capable of transcribing nucleic acid in vitro and in vivo. A preferred vector is the 
cytomegalovirus (CMV) expression vector which directs high levels of gene expression 
in muscle. 

Nucleic acid molecules which hybridize under stringent conditions to the 

20 nucleic acid molecules described herein are also within the scope of the present 
invention. As will be appreciated by those skilled in the art, multiple factors are 
considered in determining the stringency of hybridization including species of nucleic 
acid, length of nucleic acid probe, T^ (melting temperature), temperature of 
hybridization and washes, salt concentration in the hybridization and wash buffers, 

25 aqueous or formamide hybridization buffer, and length of time for hybridization and 
for washes. An example of stringent conditions are DNA-DNA hybridization with a 
probe greater than 200 nucleotides in 5 x SSC, at 65°C in aqueous solution or 42°C 
in formamide. followed by washing with 0.1 x SSC, at 65°C in aqueous solution. 
(Other experimental conditions for controlling stringency are described in Maniatis, T. 

30 et al., Molecular Cloning: a Laboratory Manual, Cold Springs Harbor Laboratory, Cold 
Springs, N.Y. (1982) at pages 387-389 and Sambrook, J. et al., Molecular Cloning: 
a Laboratory Manual, Second Edition, Volume 2, Cold Springs Harbor Laboratory, 
Cold Springs, N.Y., at pages 8.46-8.47 (1989)). 

It will be appreciated that administration of the vaccines of the present 

35 invention can be by any of the routes normally used for gene therapy. In a preferred 
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method, administration is by intramuscular injection, however, other procedures for 
transfecting cells may aiso be employed, such as transfection using calcium 
phosphate coprecipitation, liposome-mediated transfection, plasmid and viral vector- 
mediated transfection and DNA protein complex-mediated transfection. Viral vector- 
5 mediated transfection includes, without limitation, the use of retroviral, replication- 
deficient retroviral, adenoviral and adeno-associated viral vectors. Ceils transfected 
by the vaccines in the context of ex vivo gene therapy can also be administered. 

It will be appreciated that more than one route of administering the vaccines 
of the present invention may be employed either simultaneously or sequentially {e.g., 

10 boosting), in addition, the vaccines of the present invention may be employed in 
combination with traditional immunization approaches such as employing protein 
antigens, vaccinia virus and inactivated virus, as vaccines. Thus, in one embodiment, 
the vaccines of the present invention are administered to a subject (the subject is 
"primed" with a vaccine of the present invention) and then a traditional vaccine is 

15 administered (the subject is "boosted" with a traditional vaccine). In another 
embodiment, a traditional vaccine is frrst administered to the subject followed by 
administration of a vaccine of the present invention. In yet another embodiment, a 
traditional vaccine and a vaccine of the present invention are co-administered. 

Immunogenicity may be significantly improved if the vaccines of the present 

20 invention are co-administered with an immunostimulatory agent or adjuvant. 
Adjuvants enhance immunogenicity but are not necessarily immunogenic themselves. 
Immunostimulatory agents or adjuvants have been used for many years to improve 
the host immune responses to, for example, vaccines. Adjuvants may thus be 
employed to enhance the immunogenicity of the vaccines of the present invention, as 

25 well as the immunogenicity of traditional vaccines. Suitable adjuvants are well known 
to those skilled in the art and include, without limitation, aluminum phosphate, 
aluminum hydroxide, QS21, Quil A, derivatives and components thereof, calcium 
phosphate, calcium hydroxide, zinc hydroxide, a glycolipid analog, an octodecyl ester 
of an amino acid, a muramyi dipeptide, polyphosphazene, a lipoprotein, ISCOM 

30 matrix, DC-Chol, DDA, and other adjuvants and bacterial toxins, components and 
derivatives thereof. 

The vaccines of the present invention may also be co-administered with 
cytokines to further enhance immunogenicity. The cytokines may be administered by 
methods known to those skilled in the art, e.g., as a nucleic acid molecule in plasmid 
35 form or as a protein or fusion protein. 
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Upon inoculation with a pharmaceutical composition as described herein, the 
immune system of the host responds to the vaccine by producing antibodies, both 
secretory and serum, specific for Ebola virus proteins. As a result of the vaccination, 
the host becomes at least partially or completely Immune to Ebola virus infection, or 
5 resistant to developing moderate or severe disease caused by Ebola virus infection. 
Although Ebola virus infection and disease caused thereby are discussed in detail 
herein, it will be appreciated that the vaccines and methods of the present invention 
may be employed to immunize a subject against hemorrhagic fever generally, such 
as that caused by infection by the genetically-related Marburg virus. 
10 Pharmaceutical compositions comprising the nucleic acid molecules encoding 

Ebola viral proteins described herein, either alone or in combination, and a 
pharmaceutically acceptable carrier, are also provided by the present invention. As 
used herein, the phrase "pharmaceutically acceptable carrier" encompasses any of 
the standard pharmaceutical carriers, such as those suitable for parenteral 
15 administration, such as, for example, by intramuscular, intraarticular (in the joints), 
intravenous, intradermal, intraperitoneal, and subcutaneous routes. Examples of such 
formulations include aqueous and non-aqueous, isotonic sterile injection solutions, 
which contain antioxidants, buffers, bacteriostats, and solutes that render the 
formulation isotonic with the blood of the intended recipient, and aqueous and non- 
20 aqueous sterile suspensions that can include suspending agents, solubilizers, 
thickening agents, stabilizers, and preservatives. 

Formulations suitable for oral administration can consist of (a) liquid solutions, 
such as an effective amount of the vaccine dissolved in diluents, such as water, saline 
or PEG 400; (b) capsules, sachets or tablets, each containing a predetermined 
25 amount of the vaccine, as liquids, solids, granules or gelatin; (c) suspensions in an 
appropriate liquid; (d) suitable emulsions; and (e) polysaccharide polymers such as 
chitians. The vaccine, alone or in combination with other suitable components, may 
also be made into aerosol formulations to be administered via inhalation, e.g., to the 
bronchial passageways. Aerosol formulations can be placed into pressurized 
30 acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the 
like. 

Suitable formulations for rectal administration include, for example, 
suppositories, which consist of the vaccine with a suppository base. Suitable 
suppository bases include natural or synthetic triglycerides or paraffin hydrocarbons. 
35 In addition, it is also possible to use gelatin rectal capsules which consist of a 
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combination of the vaccine with a base, including, for example, liquid triglycerides, 
polyethylene glycols, and paraffin hydrocarbons. 

Each carrier must be "acceptable" in the sense of being compatible with the 
other ingredients of the formulation and not injurious to the recipient, e.g., the patient. 
5 The formulations can be presented in unit-dose or multi-dose sealed containers, such 
as ampules or vials and may be prepared by any method known in the art. 

Pharmaceutical compositions comprising any of the nucleic acid molecules 
encoding Ebola viral proteins of the present invention are useful to immunize a 
subject against disease caused by Ebola virus infection. Thus, this invention further 
10 provides methods of immunizing a subject against disease caused by Ebola virus 
infection, e.g., hemorrhagic fever, comprising administering to the subject an 
immunoeffective amount of a pharmaceutical composition of the invention. This 
subject may be an animal, for example a mammal, such as a primate or preferably 
a human. 

15 The vaccines of the present invention are also suitable for veterinary 

immunization. The vaccines of the present invention comprising nucleic acid 
molecules encoding Ebola virus structural gene products from the Reston strain, 
which is known to infect animals, are particularly useful in such veterinary 
immunization methods. 

20 The vaccines are administered in a manner compatible with the dosage 

formulation, and in such amount as will be therapeutically effective, immunogenic and 
protective. The quantity to be administered depends on the subject to be treated, 
including, for example, the capacity of the immune system of the individual to 
synthesize antibodies, and, if needed, to produce a cell-mediated immune response. 

25 Precise amounts of active ingredient required to be administered depend on the 
judgment of the practitioner and may be monitored on a patient-by-patient basis. 
However, suitable dosage ranges are readily determinable by one skilled in the art 
and generally range from about 300 fjg to about 4-5 mg. The dosage may also 
depend, without limitation, on the route of administration, the patient's state of health 

30 and weight, and the nature of the formulation. 

Methods of immunizing a subject against multiple strains of Ebola virus are 
further provided herein. The nucleic acid molecules encoding Ebola viral proteins of 
the present invention may be combined with nucleic acid molecules encoding other 
viral proteins of other virus strains to achieve protection against multiple strains of 
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Ebola virus. Typically the vaccines will be in an admixture and administered 
simultaneously, but may also be administered separately. 

In some instances it may be desirable to combine the Ebola virus vaccines of 
the present invention with vaccines which induce protective responses to other 
5 agents, particularly other viruses. For example, the vaccine compositions of the 
present invention can be administered simultaneously, separately or sequentially with 
other genetic immunization vaccines such as those for influenza (Ulmer, J.B. et a!., 
Science 259:1745-1749 (1993); Raz, E. et al., PNAS (USA) 91:9519-9523 (1994)). 
malaria (Doolan, D.L. et al., J. Exp. Med. 183:1739-1746 (1996); Sedegah, M. et al., 
10 PNAS (USA) 91 :9866-9870 (1994)). and tuberculosis (Tascon. R.C. et al.. Nat. Med. 
2:888-892 (1996)). 

It will also be appreciated that single or multiple administrations of the vaccine 
compositions of the present invention may be carried out. For example, subjects who 
are particularly susceptible to Ebola virus infection may require multiple immunizations 
15 to establish and/or maintain protective immune responses. Levels of induced 
immunity can be monitored by measuring amounts of neutralizing secretory and 
serum antibodies, and dosages adjusted or vaccinations repeated as necessary to 
maintain desired levels of protection. 

This invention also provides kits comprising the vaccines of the present 
20 invention. For example, kits comprising a vaccine and instructions for use are within 
the scope of this invention. 

The vaccines and methods of the present invention evoke a protective immune 
response and do not lead to immunopotentiation or exacerbated disease. The 
vaccines lack transmissibility, are genetically stable and induce protective levels of 
25 humoral and cell-mediated immunity. 

In order to more fully demonstrate the advantages arising from the present 
invention, the following example is set forth. It is to be understood that the following 
is by way of example only and is not intended as a limitation on the scope of the 
invention. 

30 SPECIFIC EXAMPLE 

\. RESULTS 

Immune response to viral gene products in mice. To characterize immune 
responses to selected Ebola virus proteins, eukaryotic expression vector plasmids 
were injected into mice. The cytomegalovirus (CMV) immediate early region 1 
35 enhancer was used to stimulate transcription because it directs high levels of gene 
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expression in muscle. Manthorpe, M. at al., Hum. Gene. Then 4:419-431 (1993). 
cDNAs encoding an abundant structural protein, the major viral nucleocapsid 
phosphoprotein (NP), the secreted glycoprotein (sGP), or the membrane-associated 
glycoprotein (GP) were inserted. Alternative forms of GP were chosen because it had 
5 been postulated that the transmembrane protein contained a protein sequence motif 
also found in oncogenic retroviruses that might suppress immune responses. 
Burkreyev, A.A. et ai., FEBS. Lett. 323:183-187 (1993); Cianciolo, G.J. et al.. Science 
230:453-455 (1985); Harris, D.T. et al., J. Immunol. 138:889-894 (1987); Volchkov, 
V.E. et al., FEBS. Lett. 305:181-184 (1992); Sanchez, A. et al., Virus. Res. 29:215- 

10 240 (1993). Expression of the relevant proteins was confirmed after transfection of 
the human renal epithelial cell line, 293, by immunoblotting with antisera to these 
gene products (Fig. 1A). For NP, the expected full-length 104 kDa protein normally 
produced by the virus was seen, together with lower molecular weight species likely 
generated from truncated protein or degradation products described previously. 

15 Sanchez, A. et al., Virology 170:81-91 (1989). Similarly, expression of sGP and GP 
revealed a heterogeneous pattern whose sizes correlated with the expected products 
of cleavage or post-translational carbohydrate modification. Sanchez, A. et al., PNAS 
(USA) 93:3602-3607 (1996). 

These plasmids were injected into mice to characterize their ability to induce 

20 humoral and cellular immune responses to the relevant viral proteins. Three 
injections, each with 50 //g of plasmid DNA in saline (100 //I), were performed at two- 
week intervals in Balb/C female mice (6-8 week old, Charles River). Serum from 
immunized recipients were examined for antibody responses. An antibody response 
to the viral NP gene product was readily detectable (Fig. IB), with titers of ^ 

25 1:16,000 by Western blot analysis. The titer of antibody induced in response to 
injection with plasmids encoding the viral glycoproteins was lower. After immunization 
with GP, no antibody was detectable by Western blotting, while immunization with 
sGP induced an antibody response (Fig. IB). The more sensitive ELISA (Ksiazek, 
T.G., Lab. Anim. 20:34-46 (1991); Ksiazek, T.G. eta!., J. Clin. Microbiol. 30:947-950 

30 (1992)) did allow detection of antibodies to both GP and sGP at titers of 1:400 and 
1:1,200, respectively. Cytolytic T cell (CTL) responses to these viral proteins were 
analyzed next. Despite the substantial humoral immune response to NP, minimal CTL 
activity was detected against syngeneic cells expressing this viral protein (Fig. 2A). 
in contrast, genetic Immunization with sGP, which elicited a weaker antibody 

35 response, induced a marked cytolytic T cell response to cells expressing GP (Fig. 2B). 
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Immunization with the GP plasmid also induced a significant CTL response to GP 
(Fig. 2C). These data suggested that both the secreted and transmembrane form of 
the protein could be processed for antigen presentation and the transmembrane form 
was a target for recognition by these cytolytic T cells. Finally, antigen-specific T cell 
5 proliferation to sGP was also observed in GP and sGP but not plasmid control 
injected mice (Fig. 2D). 

The ability of antibodies detected in mouse sera after immunization to 
neutralize virus was tested in an in vitro infection assay. McCormick, J.B. et a!., J. 
Infect. Dis. 147:264-267 (1983). In no case was neutralization of infectivity observed, 

10 even at titers of 1 :10 (data not shown), despite the documented presence of antibody 
after NP and sGP immunization by Western blot analysis. Infectivity in vitro was thus 
not inhibited by Ebola-specific antibodies. 

Immune function and viral challenge in guinea pigs. To determine whether 
the in vivo immune responses could protect against viral infection, virus was adapted 

1 5 to grow in guinea pigs. Though this species is not well-suited to analysis of immune 
function, infection in adult mice has not been successful. Moreover, infection in 
guinea pigs, used originally to propagate virus from infected humans, is a well- 
established animal model for the human disease. Infection gives rise to a syndrome 
of hemorrhagic fever with levels of virus, organ pathology, and infection of 

20 reticuloendothelial and mononuclear cells comparable to humans. Bowen, E.T.W. et 
al., Lancet 1:571-573 (1977). 

Two groups of immunized guinea pigs were studied. Animals were injected 
intramuscularly with the relevant expression vector plasmids, and the response to 
infection in groups Immunized with either sGP. GP, NP, or control plasmids was 

25 observed. In the first group, animals were challenged within 2 months after the initial 
immunization, at which time the antibody titers were high, ranging from 1:1,600 to 
>1:25,000 (Table 1A). In these animals, nearly complete protection from lethal 
challenge was observed in GP (6/6), sGP (5/6), and NP (4/4) immunized subjects, in 
contrast to controls (0/6). In a second group, guinea pigs were challenged four 

30 months after the initial immunization (Table 1B). As in the first group, all animals 
immunized with the control vector succumbed to infection within a week after virus 
challenge (n=4). In this group, antibody titers were lower, and three of the four 
guinea pigs immunized with NP succumbed to infection, with the single survivor 
appearing severely ill after 1 week, in contrast to the protective response with NP at 

35 the earlier time point after immunization in Group I. More effective protection was 
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seen in animals inrimunized with vector expressing GP, protection was noted in four 
of five animals challenged, with one survivor appearing weak but surviving the viral 
challenge. Similarly, three of the five animals immunized with sGP showed no ill 
effects following viral challenge. Protection in this group again correlated with the 
5 ability to sustain an effective immune response to GP or sGP. Together, all guinea 
pigs immunized with vectors expressing GP or sGP which had titers greater than 
1 :5,120 were resistant to infection (11/11) compared to 0/1 0 controls (p=0, by Fisher's 
exact test). Twelve of fourteen animals with antibody titers > 2,560 survived viral 
challenge compared to controls (p=. 00003, by Fisher's exact test). Similar to 

10 immunized mice, guinea pigs injected with GP or sGP plasmids were able to generate 
cell-mediated immune responses to the viral glycoprotein in addition to the antibody 
response. These responses were antigen-specific and T cell-dependent, as detected 
in sGP antigen-dependent spleen cell proliferation and T-cell growth factor assays 
(Fig. 3A-C). Thus, the ability to generate and sustain significant cellular immune 

15 responses in vivo correlated with protection from infection. Though antibody titer 
correlated with protection, cell-mediated immunity appeared necessary for protection 
since passive transfer of antibody to GP does not confer protection (Peters, C.J. et 
al., Filoviridae: Marburg and Ebola Viruses, in Fields Virology, (eds.. Fields, B.N., 
Knipe, D.M.& Howley, P.M.) 1161-1176 (Philadelphia, Lippincott-Raven, 1996); 

20 Jahrling, P.B. et al.. Arch. Virol. Suppl. 11:135-140 (1996)) and antisera from 
protected guinea pigs did not inhibit virus replication in vivo (n=3) or at a 1:10 dilution 
in vitro (data not shown). Since the Hartley guinea pig to which the virus has been 
adapted for growth is outbred, cellular adoptive transfer studies could not be 
performed. 



25 






TABLE 1 


- Group 1 






Plasmid 


Subiect 


ELISAfPre) 


ELlSACPost^ Viral Aa 


Survival 




GP 


1 


>1:25,600 


1:12,800 


Yes 




GP 


2 


>1:25,600 


1:25,600 


Yes 




GP 


3 


> 1:25,600 


1:25,600 


Yes 


30 


GP 


4 


1:25,600 


1:6,400 


Yes 




GP 


5 


1:25,600 


1:12,800 


Yes 




GP 


6 


1:25,600 


1 :25,600 


Yes 




SGP 


1 


1:12,800 


1:25,600 


Yes 




SGP 


2 


1:6,400 


1:25,600 


Yes 


35 


SGP 


3 


1 :6,400 


1:25,600 


Yes 
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SGP 


4 


1:25,600 


1:25,600 


Yes 


SGP 


5 


>1:25,600 


1:12,800 


Yes 


SGP 


6 


1:1,600 


Negative + 


No 


NP 


1 


1:12,800 


>1:25,600 


Yes 


NP 


2 


>1:25,600 


1:25,600 


Yes 


NP 


3 


1:12,800 


1:12,800 


Yes 


NP 


4 


1:25,600 


1:25,600 


Yes 


Vector alone 


1 


Negative 


Negative + 


No 


Vector alone 


2 


Negative 


n.d. + 


No 


Vector alone 


3 


Negative 


Negative + 


No 


Vector alone 


4 


Negative 


Negative + 


No 


Vector alone 


5 


Negative 


n.d. + 


No 


Vector alone 


6 


Negative 


n.d. + 


No 


Guinea pigs were immunized on days 1, 14, 28, 42, and challenged on day 62. 






TABLE 1 


- Group II 




Plasmid 


Subject 


ELISA(Pre) 


ELISAfPost) Viral Ag Survival 


GP 


1 


1:2,560 


n.d. +/f No 


GP 


2 


1:5,120 


1:10,240 


Yes 


GP 


3 


1:10,240 


1:10,240 


Yes 


GP 


4 


1:1,280 


n.d. +/f No 


GP 


5 


1:5,120 


1 :20,480 


Yes (ill) 


SGP 


1 


1:2,560 


n.d. 


+ No 


SGP 


2 


1:10,240 


1:5,120 +/f Yes 


SGP 


3 


1:10,240 


1:81,920 


Yes 


SGP 


4 


1:2,560 


1:5,120 


Yes 


SGP 


5 


1:640 


n.d. 


+ No 


NP 


1 


n.d. 


n.d. 


+ No 


NP 


2 


n.d. 


n.d. 


+ No 


NP 


3 


n.d. 


n.d. 


+ No 


NP 


4 


n.d. 


Negative 


+ Yes (ill) 


Vector alone 


1 


Negative 


n.d. 


+ No 


Vector alone 


2 


Negative 


n.d. 


+ No 


Vector alone 


3 


Negative 


n.d. 


+ No 


Vector alone 


4 


Negative 


n.d. 


+ No 
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Guinea pigs were immunized on days 1, 14, 42, and 112 and challenged on 
day 122. 

n.d.=not done. Viral ag denotes presence of virus determined by 
immunohistochemistry (30) performed on spleen, liver, lung, kidney, and heart 
5 tissues; "+" = widespread systemic involvement of the mononuclear phagocyte 

system and to a lesser extent endothelial and parenchymal cells; "+/f' = focal 
involvement (seen in the spleen of SGP #2, the liver and spleen of GP #1 , and 
the lung of GP#4) where rare sites of anti-Ebota antibody staining were 
detected.; "-" = no Ebola virus antigen detected in tissues. 
10 ELISA determinations made prior to viral challenge (Pre) or at least 7 days 

after (Post) infection, respectively. 

The surviving NP immunized animal (4) was found to have significant levels 
of virus in major organs by immunohistochemistry, and more than 5 logs of 
virus was detected in the serum and spleen, in contrast to GP and sGP 
15 animals where no virus was detected. 



Histopathologic analysis of infection in immunized guinea pigs. 
Pathologic analysis revealed widespread tissue necrosis and dissemination of virus 
by immunohistochemistry, similar to human disease. Virus load correlated with 
susceptibility to infection with titers of >:10^ in infected animals compared to 
20 undetectable levels in immunized survivors. In infected animals, the liver, lung, and 
spleen showed evidence of significant viral antigen by immunohistochemistry (Fig. 4, 
Table 1), and both reticuloendothelial and mononuclear phagotic involvement was 
observed. 

Determination of antibody response in animals which survived virus challenge 
25 revealed increases in the immune response to viral proteins when initial titers were 
lower (Table 1 ). Less consistent increases in antibody titers were observed in the NP 
immunized animals. These data suggest that Ebola virus infection may stimulate 
immunity in survivors of a viral challenge when immune responses are not optimal. 
II. METHODS 

30 Plasmids. Plasmlds containing the GP, sGP, or NP cDNAs (Sanchez, A. et 

al., Virus. Res. 29:215-240 (1993), Genbank) were used to subclone the relevant 
inserts into CMV expression vectors which utilized the bovine growth hormone 
polyadenylation sequence. Manthorpe, M.eia]., Hum. Gene. Ther. 4:41 9-431 (1993). 
(see Figures 5-9 and SEQ ID NOS: 1-4). Briefly, for GP, plasmid pGEM-3Zf(-)-GP 

35 was digested with EcoR 1, treated with the Klenow fragment of E. coli DNA 
polymerase, and digested with BamH I. The GP fragment was then inserted into the 
pCMV expression vector plasmid digested with BamH I, Klenow fragment and Bgl II. 
For sGP, the plasmid pCRII-sGP was digested with EcoR I, treated with Klenow 
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enzyme, and the resulting fragment inserted into the BamH l/Bgl II CMV plasmid 
which had been incubated with Klenow fragment, calf intestinal phosphatase (CIP), 
then phenol chloroform extract. For the NP expression vector, plasmid pSP64-NP2 
(Sanchez, A. et al., Virology 170:81-91 (1989)) was digested with EcoR I, treated with 
5 Klenow enzyme, and digested with BamH 1. The NP insert was cloned into CMV 
treated with BamH I, Klenow enzyme, followed by heat inacttvation and Bgl II 
digestion. 

Cell lines and transfectants. For stable transfectants, the relevant cDNAs 
were inserted into a CMV expression plasmid containing a neomycin resistant gene, 

10 pCMV-neo (H. Arai, unpublished data), which was digested with Xba I, and treated 
with CIP and Klenow enzyme. The EcoR 1/ BamH I GP fragment from pGEM-3Zf(-)- 
GP, the EcoR I sGP fragment from pCRII-SGP, or the EcoR l/BamH I NP fragment 
from pSP64-NP2 was treated with Klenow enzyme and ligated to this plasmid 
backbone. These vectors were transfected into Renca or CT26 which was syngeneic 

15 to Balb/C mice using calcium phosphate and selected in 0.7 or 1mg/ml G418 for 2-6 
weeks. Expression of GP, sGP, or NP from these vectors in Renca or CT26 cells was 
also confirmed by Western blot analysis (data not shown). 

Cell proliferation assay. Spleen cells from male Hartley guinea pigs or 
Baib/C female mice (8-10 weeks) immunized with the indicated plasmid expression 

20 vectors were incubated with sGP or vector control supernatants (25% volume:volume) 
from transfected 293 cells at the indicated celt concentrations. T cell depletion was 
performed using the CT5 monoclonal antibody (Tan, B.T.G. et al., Hybridoma 4:115- 
124 (1985)) (Biosource, Camarillo, CA) for guinea pigs oranti-Thy 1.2 antibody in the 
mouse using immunomagnetic microbeads (Miltenyi Biotec, Inc., Auburn, CA). 

25 Viral challenge in guinea pigs. Animals were immunized by injection of 100 

//I (0.5 mg/ml) in each hind leg (two injections at each time point) with the indicated 
plasmid expression vectors. Animals were challenged by inoculation with a stock of 
Ebola virus (Zaire, 1976) that had been passaged once in vero E6 cells and serially 
passaged by intraperitoneal injection of spleen homogenates in Hartley guinea pigs 

30 seven times. Immunized guinea pigs were injected intraperitoneally with 0.5 ml of a 
1:1,000 dilution of spleen cell homogenate in Hank's balanced salt solution 122 days 
after the initial plasmid DMA injection (1000 pfu). Survival was determined 10 days 
later at which times animals were sacrificed for serologic and pathologic analysis. 
ELISA, enzyme-linked immunosorbent assay (Volchkov, V.E. et al., FEBS. Lett. 

35 305:181-184 (1992); Sanchez, A. et al.. Virus. Res. 29:215-240 (1993)) on infected 
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cell supernatants and enriched viral extracts containing GP, sGP, or NP were 
performed as previously described. 
III. DISCUSSION 

Following the initial report that injection of plasmid DNA into muscle could 
5 direct the synthesis of recombinant proteins (Wolff, J.A. et al., Science 247:1465-1468 
(1 990)), the suggestion was made that this gene transfer approach may be useful for 
vaccination and was termed genetic immunization. Tang, D.C. et al., Nature 356:152- 
1 54 (1 992). This approach has been applied to different infectious diseases, including 
influenza (Uimer, J.B. et al., Science 259:1745-1749 (1993); Raz, E. et al., PNAS 
10 (USA) 91:9519-9523 (1994)), malaria (Doolan, D.L. et al., J. Exp. Med. 183:1739- 
1746 (1996); Sedegah, M. eta!., PNAS (USA) 91:9866-9870 (1994)), and tuberculosis 
(Tascon. R.C. et al., Nat. Med. 2:888-892 (1996)) and has also been used to 
modulate antibody and cell-mediated immune responses in autoimmune and allergic 
diseases. Raz. E. et al., PNAS (USA) 90:4523-4527 (1993); Waisman, A. et al., Nat. 
15 Med. 2:899-905 (1996); McCormick, J.B. et al., J. Infect. Dis. 147:264-267 (1983); 
Border, W.A. et al., Nat. Med. 1:1000-1001 (1995). 

The immune response to selected Ebola virus proteins after genetic 
immunization in mice was analyzed and their ability to protect against lethal infection 
in a susceptible animal model, the guinea pig, was tested. The immune analyses 
20 performed in different species suggest similar patterns of response, though the 
specific peptides which may be recognized by the immune system to confer protection 
in the guinea pig could differ from the mouse. Because the principles of MHC antigen 
presentation and recognition apply broadly across species (Monaco, J. J., Immunol. 
Today 13:173-179 (1992); Jorgensen, J.L. et al., Anna. Rev. Immunol. 10:835-873 
25 (1992); Zinkernagel, R.M. et al., Immunol. Today 18:14-17 (1997)), the finding that 
protection was observed in different members of an outbred strain and that similar 
immune responses were seen in different species is not unexpected and suggests 
that this approach may be applicable to humans. 

Immunization with plasmids encoding distinct viral proteins induced different 
30 antibody and cytolytic T cell responses. The broadest immune response was 
conferred by GP and sGP. which induced both cellular and humoral immunity to the 
membrane-associated GP. In guinea pigs challenged with doses of virus that are 
othenwise lethal, sGP provided nearly equivalent protection to GP, with no significant 
difference between these groups. The ability of vectors expressing GP to confer 
35 immunity may be explained by the generation of lower molecular weight degradation 
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products (Fig. 1B) which could provide sufficient protein for antigen presentation to 
induce detectable, cellular, and humoral immune responses in guinea pigs. 

Despite the fact that plasmid DNA injection has been shown to affect the 
immune response to different antigens in infectious and autoimmune diseases, the 
5 ability of individual gene products to protect against disease in vivo is not readily 
predictable. In particular, the rapid rates of Ebola virus replication and the poor 
immunogenicity of its proteins had previously rendered it resistant to immune 
interventions. Several attempts to confer protection with passive transfer of 
immunoglobulin were unsuccessful (Peters, C.J. eta!., Filoviridae: Marburg and Ebola 

10 Viruses, in Fields Virology, (eds.. Fields, B.N., Knipe, D.M.& Howley. P.M.) 1 161-1176 
(Philadelphia, Lippincott-Raven, 1996); Jahrling, P.B. etal.,>^rcA7. Virol. Suppl. 11:135- 
140 (1996)), in agreement with the finding set forth herein that antisera from protected 
animals fails to neutralize virus replication in vitro. Previous studies using formalin- 
fixed virus or purified viral proteins for immunization have also not proven effective. 

15 Peters, C.J. et al., Filoviridae: Marburg and Ebola Viruses, in Fields Virology, (eds., 
Fields, B.N., Knipe, p.M.& Howley, P.M.) 1161-1176 (Philadelphia, Lippincott-Raven, 
1996); Clegg, J.C.S. & Sanchez, A. Vaccines against arenaviruses and filoviruses. in 
New Generation Vaccines, (eds., Levine, M.M., Woodrow, G.C., Kaper, J.B.& Cobon. 
G.S.) 749-765 (New York, NY, Marcel Dekker, Inc. 1997). 

20 It is likely that traditional immunization approaches using protein antigens, 

vaccinia virus, or inactivated virus do not allow for appropriate uptake and 
presentation of viral antigens by dendritic or other antigen-presenting cells to induce 
protective immune responses. It has been shown recently that genetic immunization 
leads to production of recombinant protein(s) in muscle which are delivered to bone 

25 marrow-derived antigen-presenting cells. Iwasaki. A. et al., J. Immunol. 159:11-14 
(1997); Doe, B. etal., PNAS (USA) 93:8578-8583 (1996); Corr, M. et al., J. Exp. Med. 
1 84:1 555-1 560 (1 996). Synthesis of Ebola glycoprotein after gene transfer apparently 
allows more efficient processing and presentation and the generation of immune 
responses not seen with virus or with viral vectors. GP is a large molecule which 

30 contains both T and B cell epitopes. Although antibody levels provide a surrogate 
marker of protection, the fact that passive transfer of antibody did not confer 
protection implies that immunoglobulin switching and synthesis is reflective of the T 
helper response to GP. Genetic immunization stimulates T helper cells to generate 
both CTL and B cell antibody responses to the virus. Although antibody production 

35 confirms effective immunization, a productive T cell response, likely involving T^l eel! 
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Stimulation, as shown by the T cell proliferation and CTL assays (Fig. 3), is needed 
for effective immunity. Taken together, these studies suggest that transcription and 
translation of viral genes in host cells by genetic immunization induces alternative, 
more effective, processing and antigen presentation which better stimulates immunity 
5 to Ebola virus. Since there are yet no effective antiviral agents, the ability to generate 
protective immunity by vaccination may prove useful in selected high risk populations, 
particularly in regions of ongoing outbreaks, and among medical and laboratory 
personnel exposed to the virus. Although it remains important to identify agents 
which treat acute infection, genetic immunization may help to limit the spread of this 

10 highly lethal infectious disease. 

The foregoing discussion discloses and describes merely exemplary 
embodiments of the present invention. One skilled in the art will readily recognize 
from such discussion, and from the accompanying drawings and claims, that various 
changes, modifications and variations can be made therein without departing from the 

15 spirit and scope of the invention as defined in the following claims. 

All references cited herein are incorporated by reference as if fully set forth. 
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WE CLAIM: 

1. A pharmaceutical composition comprising a nucleic acid molecule 
encoding an Ebola virus structural gene product operatively-liked to a control 
sequence, in a pharmaceuticalty acceptable carrier. 

5 2. The pharmaceutical composition of Claim 1 , wherein the Ebola virus 

structural gene product is selected from the group consisting of the transmembrane 
form of virus glycoprotein, the secreted form of virus glycoprotein, virus nucieoprotein 
and combinations thereof. 

3. The pharmaceutical composition of Claim 1, wherein the control 
10 sequence is a promoter. 

4. The pharmaceutical composition of Claim 3, wherein the promoter is 
the CMV immediate-early region 1 promoter. 

5. The pharmaceutical composition of Claim 1, further comprising an 
adjuvant. 

15 6. The pharmaceutical composition of Claim 2, wherein the structural gene 

product is the transmembrane form of virus glycoprotein. 

7. The pharmaceutical composition of Claim 2, wherein the structural gene 
product is the secreted form of virus glycoprotein. 

8. The pharmaceutical composition of Claim 2, wherein the structural gene 
20 product is virus nucieoprotein. 
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9. A method of producing a vaccine against disease caused by infection 
by Ebola virus, comprising the steps of: 

a) administering the pharmaceutical composition of Claim 1 to a test host 
to determine an amount and a frequency of administration thereof to elicit a protective 

5 immune response in said host; and 

b) formulating said pharmaceutical composition in a form suitable for 
administration to a treatable host in accordance with said determined amount and 
frequency of administration. 

10. A vaccine comprising a nucleic acid molecule encoding the 
10 transmembrane form of the Ebola virus glycoprotein operatively-linked to a control 

sequence, in a pharmaceutically acceptable carrier. 

1 1 . The vaccine of Claim 10, wherein the control sequence is a promoter. 

1 2. The vaccine of Claim 1 1 , wherein the promoter is the CMV immediate- 
early region 1 promoter. 

15 13. The vaccine of Claim 10, further comprising an adjuvant. 

14. A vaccine comprising a nucleic acid molecule encoding the secreted 
form of the Ebola virus glycoprotein operatively-linked to a control sequence, in a 
pharmaceutically acceptable carrier. 

15. The vaccine of Claim 14, wherein the control sequence is a promoter. 

20 16. The vaccine of Claim 15, wherein the promoter is the CMV immediate- 

early region 1 promoter. 

17. The vaccine of Claim 14, further comprising an adjuvant. 



1 8. A vaccine comprising a nucleic acid molecule encoding the Ebola virus 
nucleoprotein operatively-linked to a control sequence, in a pharmaceutically 
25 acceptable carrier. 
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19. The vaccine of Claim 18, wherein the control sequence is a promoter. 

20. The vaccine of Claim 1 9, wherein the promoter is the CMV immediate- 
early region 1 promoter. 

21. The vaccine of Claim 18, further comprising an adjuvant. 

5 22. A method of Immunizing a subject against hemorrhagic fever 

comprising the step of administering to the host an immunoeffective amount of the 
vaccine of any of Claims 10 to 21. 

23. The method of Claim 22, wherein the hemorrhagic fever is caused by 
infection with Ebola virus. 

10 24. The method of Claim 22, wherein the hemorrhagic fever is caused by 

infection with Marburg virus. 

25. The method of Claim 22, wherein the host is a human and 
administration is by intramuscular injection. 

26. The method of Claim 22, wherein the subject receives a second 
1 5 administration of an immunoeffective amount of a vaccine against disease caused by 

infection by Ebola virus or Marburg virus. 
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General Description U 

DKA pVR 1012 -GPdC) 
Local object 

Created: 09/14/98 04:17PM 
Laet Modification Date: ? (r.o data) 
length: 7003 bp 
storage type: Basic 
fom: Circular 
Coranenta 

Restriction Map 
Bgllhlsite 
Clahlsite ^'^l 
Dram: 1 site 
EcoRV:1site g^lgjg 
HindlJl: 1 site 
Hpal:1site g^g 
Kaslrlsite ggcgcc 



Pm!l:1site g^ggg 

Pstl:1site gj^^g 
PvuMsite 

Sacll:1site g§°ggg 

Sail: 1 site g^gg^g 



Ncol:2sites gg^^gg 

Ndel:2 6ites g^^ 

Sphl:2sites gg^^ 
Xhoi: 2 sites 
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Bell: 3 sites '^^^ 



End: 1129 



Functional Map 
CDS (4 Signals) 
CMV IE 5' UT 
start: BB6 
cm IE INT 

start-. 1130 End: 1840 

TbGH 

start.' 4020 Endt 4 572 

Kan r 

start: 60€8 End: 6690 {Complementary) 
Misc_feature (2 signals) 
CMV enhancer 

stare: 248 End. 895 

GP(IC) 

start: 1870 End: 4019 



Annotations 
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TCCCGCGTTT CGCTGATQAC GGTGAXAACC TCTOACACAT CCAGCTCCCG 
XOCGCGCAWl GCCACTACTG CCACTTTTGG AflACTCTCTA CGTCGAGCCC 



51 GAGACCOTCA CAGCTTCTCT GTAAGCG3AT CCCCaOAQCA QACAACCCCG 
CTCTGCCAGT GXCGAACAGA CATTCCCCTA CGGCCCTCGT CTGTTCOGGC 



101 TCAGCGCGCG TCAGCCGGTG TTGGCGGGTG TCGGGGCTGG CTTAACTATG 
AOTCCCGCGC AGTCGCCCAC AACCGCCCAC AGCCCCGACC CAATTGATAC 



151 CGGCATCAGA GCAGATTGTA C7GAGAGTCC ACCATATGCO GTCXCAAATA 
GCCGTACTCC CCTCTAACAT CACICTCACG TGGTATACCC CACACTTTAT 



201 CC6CACAGAT CCGTAAGGAG AAAATACCGC ATCAGATTGC CTATTGGCCA 
GGCGTCrCTA CGCATTCCTC TTTTATGGCG TAGTCTAACC GATA&CCGGT 



251 TTGCATACGT TOTATCCATA TCATAATATG TACATTTATA TTGCCTCATG 
AACGTATGCA ACATAGGTAT AGTATTATAC ATGTAAATAT AACCGAGTAC 



301 TCCAACATTA CCGCCATCTt? GACATTGATT ATTGACTAGX TATTAATACT 
AGG7TGTAAT GOCGGTACAA C^XSTAACTAA TAACTGATCA ATAATTATCA 



AATCAATTAC GGGG7CATTA GTTCA^AGCC CATATAVCGA. GTTCCGCGTT 
TTAGTTAATG CCCCACTAAT CAAG7ATCGG CTATATACCT CAAGGCGCAA 



401 ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCCCCCA ACGACCCCCG 
TCTATTGAAT GCCATTTACC GGGCGGACCC ACTGGCGGGT TGCTCCCCCC 



451 CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 
GGGTAACTGC AGTTATTACT GCATACAAGG GTATCATTGC GGTTATCCC? 



501 CTTTCCATTG ACGTCAATGG GTGGA3TATT TACGCTAAAC TGCCCACTTG 
GAAAGGTAAC TGCAGTTACC CACCTCATAA ATGCCATTTG ACGGGTGAAC 



SSI GCAGTACATC AACTGTATCA TAfCCCAAGT ACGCCCCCTA TTGACGTCAA 
CGTCATCTAO TTCACATAGT ATACGGTTCA XCCGGGGGAT AACTCCACTT 



601 TGACGGTAAA TGGCCCGCCT caCATTAXGC CCAGTACA7G ACCTTATCCG 
ACTGCCATTT ACCGGCCCGA CCGTAATACG CGTCATGTAC TCGAATACCC 



€51 ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG 
TCAAAGGATC AACCGTCATG TAGATGCATA ATCAGTAGCC ATAATGGTAC 



701 GCGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATACC GCTTIGACTC 
CACTACCCCA AARCCCTCAT GTAGTTACCC GCACCTATCG CCAAACTGAC 



751 ACGGCCATTT CCAAGTCTCC ACCCCATTGA CCTCAATGGG AGTTTGrPPT 
TGCCCCTAAA GCXTCAGAGG TGGGCTAACT GCAGTTACCC XCAAACAAAA 



801 GGCACCAAAA tXUkACGGGAC TTTCCAAAAT CTCGTAACAA CTCCCCCCCA 
CCGTCCTTTT AGTTGCCCTG AAAGGTTTTA CAGCATTCTT GAGGC(SGGGT 
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851 TTGACGCAAA. TCGGCGGTAG GCCTGTACGG TGGGAGGTCT ATATAACCAQ 
AXCTGCGTTT ACCCGCCATC CCCACATGCC ACCCTCCACA TATATTCGTC 



SOI ACCTCCTTTA GTCAACCGTC AGATCGCCTC CAGACQCCAT CCACSCTCTT 
TCGAGCAAAT CACTTGGCAG TCTAGCGGAC CTCTGCGGTA GGTGCGACAA 



sacxx 



951 TTCaCCTCCA TASAAOACAC CGGGACCGAT CCAGCCTCCG CCGCCCCGAA 
AACTGGAG3T ATCTTCTGTG GCCCTCCCTA GGTCGGAGGC CCCCGCCCTT 



1001 CCGTOCATTG GAACCCGGAT TCCCCCTCCC AAGAGTGACG TAACTACCGC 
CCCACGTAAC CTTGCGCCTA AGGCCCACGG TTCTCACTCC ATTCATGGCG 



sphi 



1051 CTATAGACTC TATAGGCACA CCCCTTTGGC TCTTATGCAT GCTATACTCT 
GXTATCTGAG ATATCCGTGT GGGGAAACCG AGAATACGTA CCIATATGACA 



1101 TTTtaSGCTTG CCCCCTATAC ACCCCCGCTT CCTTATGCTA TAGGTGATGG 
AAAACCGAAC CCCGGATATG TCGGGGCGAA GGAATACGAT ATCCACTACC 



1151 TATAGCTTAG CCTATAGGTG TCGGTTATTG ACCATTATTG ACCACTCCCC 
ATATCGAATC GGATATCCAC ACCCAATAAC TGCTAATAAC TGGTGACCCG 



1201 TATTGCTGAC CATACT7TCC A-TTACTAATC CATAACATCC CTCTTTGCCA 
ATAACCACTG CTATGAAACG TAA7GATTAG GTATTGTACC GAGAAACCGT 



1251 CAACTATCTC TATTGGCTAT ATGCCAATAC TCTGTCCTTC AGAGACTGAC 
GTTCATAGAG ATAACCGATA TACGGTTATG AGACAGGAAG TCTCTGACTG 



1301 ACGGACTCTG TATTTTTACA GGATGGGGTC CCATTTATTA TTTACAAATT 
TCCCTGAGAC ATAAAAATGT CCTACCCCAG GGTAAATAAT AAATGTTTAA 



1351 CACATATACA ACAACGCCGT CCCCCCTCCC CGCAGTTTTT ATCAAACATA 
GTCTATATGT TGTTCCGGCA GGGGGCACGG GCGTCAAAAA TA7iTTTGTAT 



14 01 GCGTGCGATC TCCACCCGAA TCTCGGGTAC GTGTTCCGCA CATGGGCTCV 
CCCACCCTAG AGGTGCGCTT AGAGCCCATG CACAACCCCT GTACCCGACA 



1451 TCTCCGGTAG CGGCGGAGCT TCCACATCCG AGCCCTGGTC CCATGCCTCC 
AGAGGCCATC GCCGCCTCCA AGGTGTACGC TCGGGACCAG CGTACGGAGG 



1501 


AGCCGCTCAT GGTCCCTCGG CAGCTCCTTC 
TCCCCGAGTA CCAGCGAGCC GtCCAGGAAC 


CTCCTAACAC 
GAGGATTGTC 


TGGAGGCCAO 
ACCTCCGCTC 


1551 


ACTTAGGCAC ACCACAATCC CCAOCACCAC 
TGAATCCGTO TCGTGTTACC CGTGGTGGTG 


CASTCrCCCG 
GTCACACGGC 


CACAACGCCa 
CTCTTCC6GC 


IfiOl 


TGGCGGTAGG CTATGTGTCT GAAAATGAGC 
ACCCCCATCC CATACACACA CTTTTACICG 


GTGGACATTG 
CACCTCTAAC 


GGCTCGCACG 
CCOAGCGTGC 


1£51 


CCTGACGCAG ATGGAAGACT TAAGGCAGCG 
CGAC7GCCTC TACCTTCTCA ATTCCGTCGC 


GCAGAAGAAG 
CGTCTTCTTC 


ATGCACGCAG 
TACGTCCGTC 


1701 


CTGAGTTGTT GTATTCTGAT AAGAGTCACA GGTAACTCCC 
GftCTCAACAA GATAACACTA TTCTCAGTCT CCATTCAGGG 


CAACGCCACG 
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KipaX 

1751 TGTTAACGGT GGAGGCCAGT GTACTCTGAG CACTACTCG? TGCTC3CCCCG 
ACAATTGCCA CCTCCCGTCA CATCAGACTC CTCA.TGAGCA ACGACGGCGC 



Mcol 

1801 CGCGCCACCA GACATAA.TAG CTGACAGACT AACAGWJTGT TCCTTTCCAT 
GCGCGCTGGT CTCTATTATC GACTCTCTGA TTGTCTCACA AGGAAACCTA 



Sail 

KCOl F«tl Pmll Bell ECORV 



1851 GGGTCTTTTC TGCAGTCACC GTCGTCGACA CGTQTGATCX GATATCGCGG 
CCCAGAAAAG ACGTCACTGG CAGCAGCTGT GCACACTAGX CCATAGCGCC 



ISOl 


CCGCGCGGCC 
GCCGCGCCGG 


EcoRI 

GCTCTAGAAT TCTCTAATCA CAGTCATCAT G3GAGCGTCA 
CCAGATCTTA ACAGATTAGT GTCAGTAGTA CCCTCCCAGT 


1951 


GGGATTCTGC 
CCCTAAGACG 


AATTGCCCCG TGAGCGCTTC 
TTAACGGGGC ACrcGCGAAC 


AGCAAA;u=AT CTTTCrTTGT 

TccrrrrGTA caaaqaaaca 


2001 


TTGGGTAATA 
AACCCATTAT 


ATCCTATTCC ATAAAGTCTT TTCAATCCCG TTGGGGGTTG 
TAGGATAAGG TATTTCACAA AAGTTAGGGC AACCCCCAAC 


2051 


TACACAACAA TACCCTACAA GTGAGTGATA 
ATGTGTTCTT ATGG6ATGTT CACTCACTAT 


SrCACAAGTT TCTCTGCCGA 
AACTGTTCAA ACACACGGCT 


2101 


GAC7AACTCT 
CTGTTTGAGA 


CTTCAACTAG CCAATTGAAC 
GAACTTGATC GGTTAACTTC 


TCAGTCGGGT TGAACTTGGA 
AGTCACCCCA ACTTGAACCT 


2151 


GGGCAATGGA 
CCCGTTACCT 


CTACCAACTG ATGTACCAAC 
CATCGTTGAC TACATGGTTG 


GCCAACCAAA AGATCCCGTT 
CCGTTGGTCrr TCTACCCCa^ 


2201 


TTCGAGCTGG 
AAGCTCGACC 


TGTTCCACCA AAGGTGGTAA ATTACGAAGC raGAaVLTGG 
ACAAGGTGGT TTCCACCATT TAATGCTTCG ACCTCTTACC 


2251 


GCTGAGAACT 
CGAC7CTTGA 


GTTATAACCT GGCTATAAAG 
CAATATTGGA CCGATATTTC 


AAAGTTGATG GTAGTGAGTG 
TTTCAACTAC CATCACTCAC 


2301 


CCTACCAGAA 
GGATGGTCTT 


GCCCCTGAGG GACTGAGGGA 
CGGGGACTCC CTCACTCCX:T 


TTTTCCCCQT TGCCCCTATG 
AAAAGGGGCA ACGGCGATAC 


2351 


TACACAAAGT 
ATGTGTTTCA 


CTCAGGAACT GGACCATGCC 
GAGTCCTTOA CCTGGTACGG 


CAGGAGGACT CGCCTTTCAC 
CTCCTCCTGA GCGGAAAGra 


2401 


AAAGAAGCAG 
TTTCTTCCTC 


CCTTCTTCCT GTATGACCCA 
GGAAGAAGGA CATACTGGCT 


CTCGCATCAA CAATCATTTA 
GAGCGTACTT GTTAGTAAAT 


2451 


TCGGGCTACA ACCTTTCCCa AAGGAGTTAT 
ACCCCCATGT TCGAAACGGC TTCCTCAATA 


TOCATTTCTG ATCTTGCCTA 
ACGTAAAGAC TAaiiACGCAr 


2501 


AGGCGCGAAA 
TCCGCGCTTT 


GGATTTTTTC CAGTCTCCTC CATTGCATQA GCCTCCCAAC 
CCTAAAAAAG GTCAGACGAG GTAACGTACT CGGACGGTTG 
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2551 A.TGACCACGG ATCCCTCCAO TTACTATCAC ACGXCAACAA TXAACTACCT 
TACTGCTCCC TAQOGAGCTC JLfcTGATAGTG TGCTCTTGTT ATTTGATGCA 

2fi01 6GTTGATAA.T TTTGGAACCA ACACCACXGA erXTCTCTTC CAAGTCGATC 
CCAACTATTA AAACCTTGGT TGTGGTGTCT CAAAOACAAG QTTCAGC5AC 



2651 ATTTGACGTA TQTGCAGCTC CACCCAAGAT TCACACCACA ATTCCTTGTC 
TAAACICCAT ACACGTCGAC CTCCGTTCTA AGTCTGGTGT TAAGGAACAG 



2701 CTCCTAAATG AAACCATCTA CTCTGATAAC CGCAGAACTA ACACAACAGG 
QAOGATTTAC TCTGGTACAT GAGACTATTG CCCTCTTCAT TOTGTTGTCC 



27 SI AAAACTAATC TGGAAAATAA ATCCCACTGT TCATACCAGC ATGGCTCACT 
TTTTGATTAG ACCTTTTATT TAGGGTCACA ACrATGGTCC TACCCACTCA 



2801 CCCCTTTCTG GCAAAATAAA AAAACTTCAC AAAAACCCTT TCAAGTGAAG 
CCCGAAAGAC CCTTTTATTT TTTTGAAGTG TTTCTGCCAA AffT^TCACTTC 



2851 AGTTGTCTTT CCTACCTQTA CCAGAAACCC AGAACCAGGT CCTTGACACG 
TCAACAGAAA GCATGGACAT GGTCTTTGGG TCTTCCTCCA GGAACTGTGC 



2901 ACAGCGACGG TCTCTCCTCC CATCTCCGCC CACAACCACC CAGGC GAAG A 
7CTCGCTCCC AGAGAGGACC GTAGAGGCGG GTCTTGGTGC GICCGCTICT 



2951 CCACAAAGAA TTGGTTTCAG AGGATTCCAC TCCAGTGGT7 CAGATGCAAA 
CCTGTTTCTT AACCAAAGTC TCCTAAGGTG AGGTCACCAA GTCTACGTTT 



•»00I \CATCAACGG AAAGGACACA ATGCCAACCA CAGTGACGGG TGTACCAACA 
TCTACTTCCC TTTCCTGTGT TACCGTTGGT GTCACTGCCC ACATGGTTGT 



30SL ACCACACCCT CTCCATTTCC AATCAATGCT CCCAACACTG ATCATACCAA 
TGGTGTGGGA GAGGTAAAGG TTAGTTACGA GCGTTGTGAC TACTATGGTT 



3101 ATCATTTATC CCCCTGGAGG GGCCCCAAQA AGACCACACC ACCACACAOC 
TAGTAAATAC CCGGACCTCC CCGGGGTTCT TCTCGTGTCG TGGTGTCTCG 



3151 CTCCCAAGAC CACCAGCCAA CCAACCAACA GCACACAATC GACGACACTA 
GACGGTTCTC GTGGTCGGTT GGTTGGTTGT CGTGTCTTAG CTCCTGTGAT 



3201 AACCCAACAT CAGAGCCCTC CAGTAGAGGC ACGGOACCAT CCACCCCCAC 
TTCGGTTGTA CTCTCGGGAG GTCATCTCCG TGCCCTCCTA GGTCGGCCTC 



3251 GGTCCCCAAC ACCACAGAAA CCCACGCCGA AffTTOGCAAG ACAACCCCAA 
CCAGGGGTTC TCCTGTCTTT CGGTCCOOCT TOAACCCTTC TGTTGGGCT* 



3301 CCACACTCCC AGAACAGCAC ACTGCCQCCA CTGCCATTCC AAGAGCCCTQ 
GGTCTGAGGG TCrfGTCGTG TGACCGCGGT CAC6GTAACG TTCTCGGCAC 



3351 CACCCCGACG AACTCAGTGG ACCTGGCCTC CrGACCAACA CAATACCGGG 
GTGGGGCTGC TTGAGTCACC TGGACCGAAG CACTGCTTGT CTTATGCCCC 
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BaaKI 



3401 


GGTQACAAAT CTCCTQACAG GATCCACAXG AAAGCGAAGG GATGTCACTC 
CCACTCTTTA GAGGAC7CTC CTAGGTCTTC TTTCGCTTCC CTACAGTGAG 


3451 


CCAXTACACA ACCCAIU^TGC AACCCAAACC TGCACTATTG GACACCCTTG 
CCTTATGTGT TGGCTTXACG TTGGGTTTGG ACCTCATAXC CTOTCaORAC 


3501 


QA.7GAaaGTG 
CTACTCCCAC 


CTGCCATAGG TTTAGCCTOG ATACCATACT TCGOOCCAGC 
OACGOTATCC AAATCCCACC TATGGTATGA AGCCCGGTCG 


35S1 


AGCTGAGGGA ATTTAty^CTG AAGGCATAXT CGAGAJlJrCAX AATGGATTCA 

TCGACTCCer taaxtgtgac ttccctatta cctcttagtt ttacctaact 


3601 


TCTGTGGATT GAGGCAGCTG GCCAACCWLA. COACACAAGC TCTTCAXTTC 
AGACACCTAA, CTCCGTCGAC CGCTTCCTTT GCTGTGTTCG AGAACTTAAC 


3651 


TTCTTAAGCC 
AACAAirCCC 


CAACTACTGA GTTGCCTACA 1TCTCTATAC TAAATCCGAA 
GV^TGATGACT CAACGCATGT AAGAGATATG ATTTAGCCTT 


3701 


AGCAATAGAC 
TCCTTATCTG 


TTCTTGCTCC AAACATGGGG AGGAACATGT CACATTCTAG 
AAOAACGACG TTTCTACCCC TCCTTGTACA GTGTAAGATC 


3751 


GGCCTCA^G 
CCCGACTAAC 


TTGCATTGAA CCCCAAGATT GGACCAAAAA TATCACTGAT 
AACGTAACTT GGGGTTCTAA CCTCOTTTTT ATAGCGACTA 


3801 


Bell 

AAAATTGATC 
TTTTAACTAG 


AAATAATCCA TflACrrTCTC GATAATAATC TTCCAAATCA 
TTTATTAGGT ACTGAAACAG CTATTATTAG AAGGTTTAGT 


3851 


GAATGATCGC 
CTTACTACCG 


AGCAACTGCT GGACTGGATG GAAACAATGG GTTCCTGCTG 
TCCTTGACCA CCTGACCTAC CTCTGTTACC CAAGGACGAC 


3901 


GAA7AGGAAT 
OTTATCCTTA 


CACACGAGTA ATCATTCCTA TTATTGCTTT GCTGTGCATT 
GTGTCCTCAT TACTAACGAT AATAACGAAA CCACACGTAA 


3951 


TCCAAATTCA 
ACGTTTAAGT 


EcoRI 

TGCTTTGAAC TAATATAGCA TCATACTTTA GAATTCTAGA 
ACGAAACTTG ATTATATCGT AGTAT6AAAT CTTAAGATCT 



Narl 



KasX BaoKl Bcrlli 



4001 CCAGCCCCCT GOATCCAGAT CTCCTOTGCC TTCTAGTTCC CAGCCA7CTG 
CGTCCGCGGA CCTAXSGTCTA GACGACy^CGG AAGATCAACG GTCOGTACAC 



4051 TTGTTTGCCC CTCCCCCGTG CCTTCCTTCA CCCTGGAAGG TGCCACTCCC 
AACAAACGCC GAGGGGGCAC CCAASGAACT GGGACCTTCC ACGGTGAGGa 



4101 ACTGTCCTTT CCTAATAAAA TCACGAAATT GCATCGCATT CTCTGAGTAG 
TGACyiGGAAA GGAXXATTTT ACTCCTTTAA CGTACCGTAA CAGACTCATC 



4151 GTG7CATrCT ATTCTGGGGG GTGGGGTCCG CCAGCACAGC AAGGGGGAGC 
CACAGTAAGA TAAGACCCCC CACCCCACCC CGTCGTGTCC TTCCCCCTCC 
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SpbZ Kpnl 



«201 ATTGSSAAGA. CAATACCAGG CATGCTCGCa ATGCGGTGGG CTCTATOGGT 
TAACCCTTCT GTTATCGrTCC GTACCACCCC TACGCCACCC GAGATACCCA 



iqpzxl 

4251 ACCCACCT6C T6AACAATTG ACCC6GTTCC TCCTGGGCCA GAAACAAGCA 
TC6GTCCACC ACTTCTTAAC TCOflCCAAGG AGGACCCCCT CTTTCTTCGT 



4301 CCCACATCCC CTTCTCTGTG ACACACCCTG TCCACGCCCC TOGTTCTTAG 
CCGTGTAGGG GAAGA6ACAC TCTOTGaGAC AGGTCCGGGG ACCAAGAATC 



4351 rrCCAGCCCC ACTCATAGGA CACTCATAGC TCAGCACCCC TCCGCCTTCA 
AAQGTCGGCC TGAGTATCCT CTGAGTATCG ACTCCTCCCG AGGCGGAAGT 



4401 A2CCCACCCC CTAAAGTACT TCGAGCGGTC TCTCCCTCCC TCATCAGCCC 
7AGGCTCCGC GATTTCATCA ACCTCGCCAC AGAGGGAGGG AGTAGTCGQQ 



4451 ACCAAACCAA ACCTAGCCTC CAAGAGTCCC AAGAAATTAA ACCAAGATAG 
TCCTTTGGTT TGGATCGGAG GrTCTCACCC TTCTTTAATT TCGTTCTATC 



4501 GCTATTAAST GCAGAGCGAG AGAAAATCCC TCCAACATCT CACCAA6TAA 
CCATAATTCA CGTCTCCCTC TCTTTTACGG AGGTTGTACA CTCCTTCATT 



XmnI 



4551 TGAGACAAAT CATASAATTT CTTCCGCTTC CTCGCTCACT CACTCGCTGC 
ACrCrCTTTA CTATCTTAAA CAAGGCGAAG GACCCAGTGA CTGAGCGACG 



4601 CCTCGGTCGT TCGGCTGCGG CCACCGGTAT CAGCTCACTC AAAG3CGGTA 
CCAGCCAGCA AGCCGACGCC CCTCGCCATA GTCCAGTGAG TTTCCGCCAT 



4651 ATACCGTTAT CCACAGAATC ACCCGATAAC GCAGGAAAGA ACATGTCACC 
TATGCCAATA GGTGTCTTAC TCCCCTATTG CCtCCTTTCT TGTACACTCG 



4701 AATkAGGCCAG CAAAAGGCCA GGAACCGTAA AAACGCCGCG TTGCTGCCGT 
TTTTCCGGTC GTTTTCCGGT CCTTGGCATT TTTCCGGCGC AACGACCGCA 



4751 rrrrccATAG gctccgcccc cctgacgagc atcacaaaaa tcgacgctca 

AAAAGGTATC CCAGGCGGGG CCACTGCTCG TACTCTTTTT A.GCTGCGAGT 



4801 AGTCAGAGGT 6CCGAAACCC GACA6GACTA TAAACATACC AGGCG77TCC 
TCAGTCTCCA CCGCTTTGGG CTGTCCTGAT ATTTCTATGG CCCGCAAAGG 



4851 CCCTGGAAGC TCCCTCGTGC CCTCTCCTGT TCCGACCCTG CCGCTTACCG 
GGGACCTTCG AGGGACCACO CGACAGGACA AGGCTGCCAC GGCGAATGCC 



4901 CATACCTGTC CCCCTTTCTC CCTTCGGGAA GCGTCGCGCT TTCTCAATGC 
CTATCOACAG GCGOAAAGAO GOAACCCCTT CGGACCGCCA AAGAGTTACC 



4951 7CACCCTGTA CGTATCTCAC TTCGGTGTAC CTCOTTCGCT CCAAGCTGGG 
AGTGCGACAT CCATAGAGTC AAGCCACATC CAQCAAGCCA GGTTCGACCC 



5001 CTGTCTCCAC GAACCCCCCG TTCAGCCCGA CCGCTGCCCC TTATCCGCTA 
GACACACGTG CTTGQGGGGC AAGTCGGGCT GCCGAOSCGG AATAfiGCCAT 
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5051 ACTATCCTCT TGAGTCCXXC CCGGTAAGAC ACGACTTXTC GCCACTGGC^ 
TGA3*CCAGA ACTCACCTTG GGCCATTCTG TGCTGAATAC CGOTQACCGT 



5101 GCACCCACTG GTAACAGGAT TAGCAC*J=CG AGGTAXCTXC GCOaTQCTAC 
CCTCGGTGAC CATTGTCCTA ATCCTCTCGC TCCATACATC CGCCACCATC 



5151 ACaiCTTCTTG AACTGGTGGC CTAACTACGG CTACACTACA AGGACAGTAT 
TC7CAASAAC TTCACCACCG GATTOATGCC CATCTTGATCT TCCTCTCATA 



5201 TTGGTATCTC CGCTCTSCTC AAGCCAGTTA CCTTCGGAAA AAGACTTCOT 
JIACCATAGAC GCCXGACGAC TTCGGCXUlAT GGAAGCCTTT TTCTCAACCA 



525X AGCTCTTGAT CCGGCAAACA AACCACCGCT GOTAGCGGTG GXTTTTTTGT 
TCGAGAACTA CGCCGTTTGT TTGGTGGCGA CCATCGCCAC CAAAAAAACA 



5301 rrCCAAGCAG CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT 
AACGTTCGTC CTCTAATGCG CCTCTTTTTT TCCTACAOTT CTTCTAGGAA 



5351 TCATCTTTTC TACCCGGTCT GACGCTCAGT GGAACGAAfcA CTCACGTTAA 
AtrZAGAAAAC ATGCCCCACA CTGCGAGTCA CCTTCCTTTT GAGTGCAATT 



«C01 GGCATTTTGG TCATGACATT ATCAAAAAGC ATCTTCACCS ACATCCTTTT 
CCCTAAAACC AC^ACTCTAA TAGTTTTTCC TAGAAGTCGA. TCTAGGAAAA 



S451 AAATTAAAAA rfiAAGTTTTA AATCAATCTA AAGTATATAT CAGTAAACTX 
TTTAATTTTT ACTTCAAAAT rPAGTTAGAT TTCATATATA CTCATTTCAA 



5501 CCTCTGACAC TTACCAATGC TTAATCAGTG AGCCACCTAT CTCAGCCATC 
CCAGACTCTC AATGGTTACG AATTAGTCAC TCOSTGGATA GAGTCGC7AG 



5551 TGTCTATTTC GTTCATCCAT AGTTGCCTGA CTCCG6GGGG GGG5SCCGCT 
ACAGATAAAG CAAGTAGSTA TCAACGGACT GAGGCCCCCC CCCCCCGCGA 



5601 GAGGTCTCCC TCGTGAACAA GGTGTTGCTC ACTCATACCA GGCCTCSAATC 
CXCCAGACGG AGCACTTCTT CCACAACGAC TGACTATGGT CCGGACTTAG 



5651 GCCCCATCAT CCAGCCAGAA ACTGAGGGAG CCACGGTTGA TGAGAGCTTT 
CGGGGTACTA GGTCGGTCTT TCACTCCCCC GGTGCCAACT ACTCTCGAAA 



5701 GTTGTACCTG GACCAGTTGG TGATTTTCAA CTTTTGCTTT GCCACGGAAC 
CAACATCCAC CTGGTCAACC ACTAAAACTT GAAAACG&AA CGGTGCCTTG 



5751 GGTCTGCGTT GTCCGGAAGA TGCCTGATCT GATCCTTCAA CTCAGCAAAA 
CCAGACGCAA CAGCCCTTCT ACGCACTAGA CTAGGAAGTT GAGTCGTTTT 



5801 GTTCGATrTA TTCAACAAAG CCGCCGTCCC CTCAAGTCAG CGTAArTGCTC 
CAACCTAAAT AACTTGTTTC GGCCGCAGGG CAGTTCACTC GCATTACGAC 



5651 TGCCAGTGTT ACAACCAATT AACCaATTCT CATOAGAAAA ACTCATCCAC 
ACCCTCACAA TCTTCOTTAA TTGGTTAAGA CTAATCTTTT T6AGTAGCTC 



5901 CATCAAATGA AACTGCAATX TATTCATATC AGGATTATCA ATACCATATT 
GTAGTTTACT TTGACCTTAA ATAAGTATAC TCCTAATAGT TATCGTATAA 



6951 7TTGAAAAAG CCGTTTCTCT AATGAAGGA6 AAAACTCACC GAGGCACTTC 
AAACTTTTTC GGCAAACACA T7ACTTCCTC «rTTGAGTGG CTCCGTCAAG 



'^^iiB '"ii!! .::ii„ 3; n;:]; yi Q B :i. II .j 
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€001 CX-meSATCG CAACATCCTO GTATCCGTCT GCGATTCCCA CTCGTCCAAC 
GOaaCCTACC GTTCTAGGAC CATAGCCAGA CCCTAAGQCT GAGCACGTTC 



£301 AAAATCACrC CCATCAACCA AACCGTTATT CATTCOTGAT TGCGCCTCAC 
TTTTAOTOAQ CGTAGTTCGT TTGGCAATAA CTAAGCACTA ACCCGGACTC 



S2S1 CCA£5ACGAAA TACGCGATCS CTGTTAAAAG GACAATTACA AACAGGAATC 
GCTCTGCTTT ATGCGCTACC GACAA'TTTTC CTGTTAATGT TTGTCCTTAG 



eeOl ACATTATCCC GAGCCCATTT ATACCCATAT AAATCAGCAT CCATGTTGGA 
TGIAATAGCG CTCGJWSTAAA TATOCCTATA TTTAGTCCTA GGTACAACCT 



6651 ATTTAATCGC CCCCTCGAGC AAGACGTTTC CCCTTOAATA TGCCTCATAA 
TAaATTACCG CCGGAGCTCG TTCTGCAAAQ GGCAACTTAT ACCGAGTATT 



6701 CAOCCCrrCT ATTACMTTT atotaaccag acagttttat tgttcatcat 

GTCCGQAACA TAATOACAAA 9ACATTCGTC TGTCAAAATA ACAAGTACTA 



6751 GMATATrrP TATCTTCTCC AATGTAACAT CAGAGATTTT GAGACACAAC 
CTATATAAAA ATACAACACG TTACATTGTA GTCTCTAAAA CTCTCTCTTG 
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General Description 

DKX pVR 1012-GP<S) 
Local object 

Cz-eatcdt 09/14/98 03i58PM 

Last Kodilication Date: ? (ao data) 

lenatui 7073 top 

etoxage type. Basic 

Cozsn: Circular 



Restriction Map 

Ball: 1 site ][^gt 
Bdl: 1 site 

Clal: 1 site ^J^l 
Drain: 1 site g^iSc 
Hindlll: 1 site ^^gg^ 
Kasi: 1 site 
Kpnl: 1 she 
Narl: 1 site ^°^g 
Pmll: 1 site %^^c 
Pvul: 1 site g^g^ 
Sacll: 1 site 
sail: 1 site 

Xbal:1site I^$g§ 
Xmni> • site cttnnnnaag 
Ndei:2sites g^^^Jg 
EcoRV: 3 sites g^^SIa 
SphI: 3 Sites ®g$^ 



BamHl: 6 sites 

Functional Map 
CDS (4 signals) 
CMV tE 5" UT 
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Start: e3S End: 1129 
CA/IV IE INT 

Start: 1130 End: 1B40 
TbGH 

Start: 4090 End: 4642 

Kan r 

start: 6138 Endj 6760 (ConplGir^tary) 

Misc_feature (2 signals) 
CMV enhancer 

start: 248 End: 885 
GP(S) 

start; 1370 E=di 4089 



Annotations 
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X 


■rCCCaCGTTT 
ACSCGCCCAAA 


CGGTGATCAC 
GCCACTACTG 


GGTGAAAACC TCTGACACAT GCAGCTCCCC 
CCACTTTTGG ACACTGTGTA CCTCCAOGGC 


51 


GAGACGGCCA 
CTCTGCCAGT 


CA6CTTGTCT 
GTCQAACACA 


GTAAGCGOAT GCCGGOAGCA GACAACCCCG 
CATTCGCCTA CGtSCCCTCGT CTGTTCGGCC 


101 


rCAGGGCGCG 
AGTCCCGCQC 


TCAGCGGGTG 
AGTCCX:CCAC 


TTGGCGGGTG TCGGGGCTGG CTTAACTATG 
AACCGCCCAC ACCCCCGACC CAATTGATAC 








Ndel 


ISZ 


CGCCATCA5X 
GCCGTAGTCT 


CCAGATTCTA 
CGTCTAACAT 


CTGAGACTGC ACCATATGCO GTGTGAAATA 
GACTCTCACG TGCTATACGC CACACTTTAX 








Ball 


201 


CCGCACAGAT 
GCCGTGTCTA 


GCGTAAGCAG 
CGCATTCCTC 


AAAATACCGC ATCAGATTGG CTATTCGCCA 
TTTTATGGCG TAGTCTAACC GATAACCGGT 




TTGCATACGT 
AACGTATGCA 


TGTATCCXTA TCATAATATG TACATTTATA rTTGGCTCATG 
ACATAGGTAT AGTATTATAC ATGTAAATAT AACCGAGTAC 


301 


CCCAACATTA 
AGGTTGTAAr 


CCGCCATGTX 
GGCGGTACAA 


GACATOGATT ATTGACTAGT TATTAATACT 
CTGTAACTAA TAACTGATCA ATAATTATCA 


ISl 


AATCAArXAC 
TTAGTTAATG 


GGGGTCATTA 
CCCCAGTAAT 


GTTCATAGCC CATATATGGA GTTCCGCGTT 
CAACTA7CGG G7ATATACCT CAAGGCGCAA 


401 


ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 
■rC^ATTGAAT GCCATTTACC GGGCGGACCG ACTGGCGGGT 1'CCTGGGGGC 


451 


CCCATTGACG 
GGGXAACTGC 


TCAATAATGA 
AGTTATTACT 


CGTATGTTCC CATAGTAACG CCAATAGGGA 
GCATACAAGG GTATCATTGC GGTTATCCCT 


501 


CTTTCCATTC 
GAAACCTAAC 


ACGTCAATGG 
TOCAOTTACC 


GTGGAGTA1V TACGGTAAAC TGCCCACTTC 
CACCTCATAA ATGCCATTTG ACGGGTGAAC 






Ndel 


551 


GCAGTACATC 
CGTCATGTAG 


AAGTGTATCA TATGCCAftCT ACGCCCCCTA TTGACGTCAA 
TTCACATACT ATACGGTTCA TGCGGGGGAT AACTGCAGTT 


eoi 


TGACGGTAAA 
ACTGCCATTT 


TGGCCCCCCT 
ACCGGGCGGA 


GGCATTATGC CCAGTACATG ACCTTATGCG 
CCGTAATACG GGTCATGTAC TGGAATACCC 








NcoX 




ACTTTCCTAC 
TGAAACCATG 


TTGGCAGTAC 
AACCCTCATG 


ATCTACGTAT TAGTCATCGC TATTACCATG 
TAGATGCATA ATCAGTAGCG ATAATGGTAC 




Keol 






701 


GTGATGCGGT TTTGGCAGTA 
CACTACGCCA AAACCGTCAT 


CATCAATGGG CGTGGATACC GGrTTCACTC 
GTAGTTACCC GCACCTATCG CCAAACTGAG 



751 ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTCTTTT 
TGCCCCTAAA GGTTCAGACG TGGGGTAACT GCAGTTACCC TCAAACAAAA 



m :e '9 ^iii ,„ cii ii:;:!!„:i. 

wo 99/32147 PCT/US98/27364 



801 GGCACCAAA*. TCJU^CGGCAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA 
CCGTGGTTTT AGTTGCCCTG AAAGGTTTTA CAGCATTGTT GAGGCCGGGT 



851 TTGACGCAAA TCOOCGGTAG CCGTGTACCC TGGGAGGTCT ATATAAGCAG 
AACTGCGTTT ACCCGCCATC CGCACATGCC ACCC5CCAGA TATATTCGTC 



901 AGCTCCTTTA CTOAACCGTC AGATCGCCTG GAGACGCCAT CCACGCTGTT 
TCGAGCAAAT CACTTGCCAG TCTAGCGGAC CTCTCCGGTA GGTGCGACAA 



SacZZ 



951 TTGACCTCCA TAOAAQACAC CG3GACCCAT CCA6CCTCCO CGGCCGCCAA 
AACTG6AGGT ATCTTCTGTG GCCCTGGCTA GGTCGCAGGC GCCGGCCCTT 



1001 CGGTGCATTG GAACGCGGAT TCCCCGTGCC AAGAGTGACG TAAGTACCGC 
GCCACGTAAC CTTGCGCCTA AGGGGCACCG TTCTCACTGC ATTCATGGCQ 



SphI 



1051 CTMTAGACTC TA-TAGGCACA CCCCTTTGGC TCTTATGCAT GCTATACTGT 
CATATCTGAG ATATCCGTGT GGGCAAACCG AGAATACGTA CGATATGACA 



1101 TTTCGGCTTG GGGCCTATAC ACCCCCCCTT CCTTATCCTA TAGGTGATGG 
AAAACCGAAC CCCGGATATG TGGGGCCGAA GGAATACGAT ATCCACTACC 



1151 TATAGCTTAG CCTATAGGTG TGGGTTATT6 ACCATTATTG ACCACTCCCC 

atatcgaatc ggatatccac acccaataac tggtaataac tggtgagggg 



1201 tattcgtgac gatactttcc attactaatc gataacrc:gc ctctttgcca 
ataaccactg ctatgaaagg taatgattag gtattctacc gacaaacggt 



1251 CAACTATC7C TATTGGCTAT ATGCCAATAC TCTGTCCTTC AGA6ACTGAC 
GTTGA7AGAG ATAACCCATA TACGGTTATG AGACAGGAAG TCTCTGACTG 



1301 ACGGACTCTG TATTTTTACA GGATGGGGTC CCATTTATTA TTTACAAATT 
TGCCTGAGAC ATAAAAATGT CCTACCCCAG GCTAAATAAT AAATGTTTAA 



13 51 CACATATACA ACAACGCCGT CCCCCCTOCC CGCACTTT7T ATTAAACATA 
GTGTATATGT TGTTCCGGCA GGGGGCACGG CCGTCAAAAA TAATTTGTAT 



1401 GCGTGGGATC TCCACGCGAA TCTCGGGTAC CTGTTCCGGA CATGGGCTCT 
CGCACCCTAG AGGTGCGCTT AGAGCCCATC CACAAGGCCT GTACCCCAGA 



1451 TCTCCGGTAG CGGCGGAGCT TCCACATCCC AGCCCTGGTC CCATGCCTCC 
AGACGCCATC GCCGCCTCGA AGGTGTACGC TC6GGACCAG GGTACGGAGG 



1501 ACCGGCTCAT GGTCGCTCGG CAGCTCCTTG CTCCTAACAG TGGAGGCCAG 
TCGCCGAGTA CCAGCGAGCC CTCGAGGAAC GAGGATTGTC ACCTCCGGTC 



1551 ACOTAGGCAC AGCACAATGC CCACCACCAC CAGTGTGCCG CACAAGGCCG 
TGAATCCGTG TCGTGTTACG GCTGGTGGTC GTCACACGGC GTGTTCCCGC 



1601 TGGCGGTAGG GTATGTGTCT CAAAATGAGC GTGGAGATTG GGCTCGCACG 
ACCGCCATCC CATACACACA CTTTTACTCG CACCTCTAAC CCGAGCGTGC 



1651 GCTGACGCAG ATGGAAGACT TAAGGCAGCG GCAGAAGAAO ATCCAGGCAG 
CGACTGCGTC TACCTTCTGA ATTCCGTCGC CGTCTTCTTC TACGTCCCTC 



PCT/US98/27364 



Bell EcoRV 



2451 TTXTTCCTC TATGACACCC TGGC7TCARC TGTAATTTAC ACAGGAGTCA 
AAAGAAGCAG ATACTGTCCG ACCGAAGTTG ACATTAAATG TCTCCTCAGT 
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2501 


TAAAACGACT 


GGGGOTAXTC GCAITCTTGA 
CCCCCATTAG CCTAAQJ^ACT 


TATTGGCTAA 
ATAACCCATT 


ACCAAAGGAA 
TGGTTTCCTT 


2551 


ACGTTCCTTC 
TCCAAGGAA.G 


AATCACCCCC CATTCGAGAG 
TTACTGGGGO CTAAGCTCIC 


GCAGCAAACT 
CGTCGTTTGA 


ACAC7GAAAA 
TGTGACTTTT 


260X 


TACATCAAGT TACTATGCCA CATCCTACTT GGAGTACGAA ATCGAAAATT 
ATGTAGTTCA ATGATACCGT GTAGGATGAA CCTCATGCTT TAGCTTTTAA 


2651 


TTGGTGCTCA ACACTCCACS ACCCTTTTCA AAATTAACAA 
AACCACCAGT TGTGAGGTCC TGG6AAAAGT TTrAATTGTT 


TAATACTTTT 
ATTAXGAAAA 


2701 


C77CTTCTCS 
CAAGAAGACC 


ACAGCCCCCA CACGCCTCAG 
TGTCCGGCGT GTGCCCAGTC 


TTCCTTTTCC 
AAGGAAAAGG 


AGCTGAATGA 
TCGACTTACT 


2751 


TACCATTCAA CTTCACCAAC AGTTCAGCAA CACAACTGGG AAACTAATTT 
ATCGTAAGTT GAAGTGGTTG TCAACTCGTT GTGTTGACCC TTTGATTAAA 


2801 


GGACACTAGA 
CCTGTGATCT 


^rOCTAATATC AATGCTGATA 
ACGATTATAG TTACGACTAT 


TTCGTGAATC 
AACCACTTAC 


GGCTTTTTGG 
CCGAAAAACC 


2851 


GAJ^TAAAA 
CTTTTATTTT 


AAATCTCTCC GAACAACTAC 
TTTAGAGAGG CTTGTTGATG 


GTGGAGAAGA 
CACCTCTTCT 


GCTGTCTTTC 
CGACAGAAA6 


2301 


GAAACrCTAT 
CTTTCAJIATA 


CGCTCAACGA GACAGAAGAC 
GCGAGTTGCT CTGTCTTCTG 


CATGATGCGA 
CTACTACGCT 


CATCGTCGAG 
GTAGCAGCTC 


2951 


AACTACAAAG 
TTGATGTTTC 


GGAAGAATCT CCGACCGGGC 
CCTTCTTACA GGCTGGCCCG 


CACCAGCAAG 
G'TGGTCCTTC 


TATTCGGACC 
ATAAGCCTGG 


3001 


rrCGTTCCAAA 
ACCAACGTTT 


GGATTCCCCT GGGATGGTTT 
CCTAAGGGGA CCCTACCAAA 


CATTGCACGT 
GTAACGTGCA 


ACCAGAAGGG 
TCGTCTTCCC 


3051 


ca;u^caacat 

CTTTCTTGTA 


TGCCGTCTCA GAATTCGACA 
ACCGCAQAGT CTTAAGCTGT 


GAAGGTCCAA 
CTTCCAGCTT 


GAGTAGATGT 
CTCATCTACA 


3101 


GAATACTCAG 
CTTATGAGTC 


GAAACTATCA CAGAGACAAC 
CTTCGATACT GTCTCTGTTG 


TGCAACAATC 
ACGTTGrrTAG 


ATAGGCACTA 
TATCCGTCAT 


3151 


ACGGTAACAA 
TGCCATTGTT 


CATGCAGATC TCCACCATCG 
GTACGTCTAG AGGTGCTAGC 


GGACAGGACT 
CCTGTCCTGA 


GAGCTCCAGC 
CTCGAGCTCG 


3201 


CAAATCCTGA 
GTTTAGGACT 


Ncol 

GTTCCTCACC GACCATCGCA 
CAAGGACTGG CTCGTACCGT 


CCAAGCCCTG 
GGTTCGGCAC 


AGACTCAGAC 
TCTGAGTCTG 


3251 


CTCCACAACC 
GAGCTGTTGG 


TACACACCAA AACTACCAGT GATGACCACC 
ATGTGTGCTT TTGATGGTCA CTACTGGTCC 


CAGGAACCAA 
CTCCXXGGTT 



3301 CAACACCACC GAGAAACTCT CCTGGCTCAA CAACACAAGC ACCCACTCTC 
GrTGTGGTGG CTCTTTGACA GGACCGAGTT GTCGTCTTCC TGGGTGACAQ 



3351 ACC7VCCCCAG AGAATATAAC AACAGCGGTT AAAACTGTrT GGGCACAAGA 
TCGTGGGGTC TCTTATATTG TTGTCGCCAA TTTTGACAAA CCCCTGTTCT 
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3401 


GTCCRCAAGC AACGGTCTAA TAACTTCAAC AGTAACACGT XTTCTTGGCA. 
CJiaGTGTTCC TTGCCACA.TT ATTCAAGTTG TCATTGTCCA TAACAACCCT 




3451 


CCCTTGCAC? 7COAAAACGC AGCACAAGAC AAGTTAACfcC CAGCGCCACG 
CGGAACCTGA AGCTTTTGCG TCCTCTTCTG TTCAATTGTG GTCCCGGTGC 




3501 


GGTAAATGCA ATCCCAACTT ACACTACTGG ACTGCACAAG AACAACATAA 
CCATTTACGT TAGGGTTGAA TGTGATGACC TGACGTGTTC rrGTTGTATT 




3551 


BaoHI 

tgctgctcgg attgcctgga tcccgtactt tggaccgggt gcagaaggca 
acgacgaccc taaccgacct agggcatgaa acctggccca cgtcttccgt 




3601 


TATACACTGA AGGCCTTATG CACAACCAAA ATCCCTTAGT CTCTGGACTC 
ATATG7CACT TCCGGAATAC GXGTTGGTTT TACGGAATCA GACACCTGAG 




3651 


AGACAACTTG CAAATGAAAC AACTCaAGCT CTCCAGCTTT TCTTAAGGGC 
TCTGTTCAAC GTrTACTTTG rTTGAGTTCGA GACGTCGAAA AGAAT'.'CCCG 




3701 


CACGACGCAG CTGCCGACAT ATACCATACT CAATAGGAAG GCCATAGATT 
GTGCTCCCTC GACGCCTGTA TATCGTATGA GTTATCCTTC CGGTA-rCTAA 






BamHl 




3751 


TCCTTCTGCG ACGATGGCGC GGGACATGTA GGATCCTGGG ACCAGATTGT 
ACGAAGACCC TGCTACCCCG CCCTCTACAT CCTAGGACCC TGGTCTAACA 




3801 


CCCATTGAGC CACATCATTG GACCAAAAAC ATCACTGATA AAATCAACCA 
ACGTAACTCG GTGTACTAAC CTGGTTTTTG TAGTGACTAT TTTAGTTGG'? 




3851 


AATCATCCAT GATTTCATCG ACAACCCTTT ACCCAATCAG GATAATGATC 
TTAGTAGGTA CTAAAGTAGC TGTTGGGAAA TGGGTTAGTC CTATTACTAC 






BanHI 




3901 


ATAATTGGTG GACGGGCTGG AGACAGTGGA TCCCTGCAGG AATAGGCATT 
TATTAACCAC CTGCCCGACC TCTGTCACCT AGGGACGTCC TTATCCGTAA 




3951 


ACTGGAATTA TTATTCCAAT CATTGCTCTT CTTTGCGTCT GCAAGCTGCT 
TGACCTTAAT AATAACGTTA GTAACGAGAA GAAACGCAGA CGTTCGACGA 






BanKl 




«001 


■rrGTTGAATA TCAGAATTCC AGCACTGCCG GCCGTTACTA GTGGATCCGA 
AACAACTTAT AGTCTTAAGG TCGTGACCGC CGGCAATGAT CACCTAGGCT 






Marl 






BsmHI Xbal KasX EanHI 




4051 


CCTCGGATCC AAGCTCTAGA CCAGGCGCCT GGATCCAGAT CTGCTGTGCC 
CGAGCCTAGG n'CCAGATCT GCTCCGCGGA CCTAGGTCTA GACGACACGG 




4101 


TTCTAGT7GC CAGCCATCTG TTGTTTGCCC CTCCCCCGTC CCTTCCTTGA 
AAGATCAACG GTCGGTAGAC AACAAACGGG GAGGGCGCAC GGAAGGAACT 





4151 CCCTGGAAGG TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGAGCAAATT 
GGCACCTTCC ACGGTGAGGG TGACAGGAAA GGATTATPTT ACTCCTTTAA 



wo 99/32147 



19 



4201 GCATCGCATT CTCTCAGTAO GTGTCATTCT ATTCTGGGGG GTGGGGTGGC 
CCTAGCGTAA CAGACTCATC CACAGTAAGA TAAGACCCCC CACCCCACCC 



SphI 



<251 CCAGCACAGC AAGGGGGACSG ATTGGGAAGA CAATAGCAGG CATGCTGGGG 
CGTCGTGTCG TTCCCCCTCC TAACCCTTCT OTTATCGTCC GTACGACCCC 



Kpni 



43 01 ATGCCGTGGG CTCTATGGGT ACCCAGGTGC TCAAGAATTC ACCCGGTTCC 
TACGCCACCC GAGATACCCA TGGGTCCACG ACTTCTTAAC TGGCCCAAGG 



4351 TCCTGGGCCA GAAAGAAGCA GGCACATCCC CTTCTCTGTG ACACACCCTG 
ACGACCCGGT CTTTCTTCGT CCGTGTAGCC GAAGACACAC TGTGTGGGAC 



4401 TCCACGCCCC TGGTTCTTAG TTCCAGCCCC ACTCATACGA CACTCATAGC 
AGGTGCGGCG ACCAACAATC AAGGTCGGCG TGAGTATCCT GTGAGTATCG 



4451 TCAGGAGGCC TCCGCCTTCA ATCCCACCCG CTAAAGTACT TGGAGCGGTC 
AGTCCrCCCG AGGCGGAAGT TAGGGTCCGC GATTTCATGA ACCTCGCCAG 



4 SOI TCTCCCTCCC TCATCAGCCC ACCAAACCAA ACCTAGCCTC CAAGAGTGGG 
AGAOGGAGGG AGTAGTCCGG TGGTTTGGTT CGGATCGGAG GTTCTCACCC 



4551 AAGAAAmAA AGCAAGATAG GCTATTAAGT GCAGAGGGAG AGAAAATGCC 
TTCTTTAATT TCGXTCTATC CGATAATTCA CGTCTCCCTC TCTTTTACGG 



xsanl 



4fiOl TCCAACAICT OAGGAAGTAA TCAGAGAAAT CATAGAATTT CTTCCGCTTC 
AG6TTGTACA CTCCTTCATT ACTCTCTTTA GTATCTTAAA GAAGGCGAAG 



4651 CTCGCTCACT GACTCGCTGC GCTCGGTCCT TCGGCTCCGG CGACCGGTAT 
GAGCGAGTGA CTGAGCGACC CGAGCCAGCA AGCCGACGCC CCTCGCCATA 



4701 CAGC7CACTC AAAGGCGGTA ATACGGTTAT CCACAGAATC ACGGGATAAC 
GTCGAGTGAG TTTCCGCCAT TATGCCAATA GGTGTCTTAG rCCCCTATTO 



4751 GCAGGAAAGA ACATGTGAGC AAAAGCCCAG CAAAAGGCCA GGAACCGTAA 
CGTCCTTTCT TGTACACTCG TTTTCCGGTC GTCTTCCGGT CCTTGGCATT 



4 8 OX AAAGGCCGCG TTGCTGGCGT TTTTCCATA6 GCTCCGCCCC CCTC3ACGAGC 
TETCCGGCGC AACGACCGCA AAAAGGTATC CGAGGCGCGG GGACTGCTCG 



4 851 ATCACAAAAA TCGACGCTCA ACTCAGACCT GGCGAAACCC GACACOACTA 
TAGTGTTOTT AGCTGCCAGT TCAGTCTCCA CCGCTTTGCC CTGTCCTGAT 



4301 TAAAGATACC AGCCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT 
ATTTCTATCG TCCGCAAAGG GGCACCTTCC AGOGAGCACC CGACAGGACA 



4951 TCCGACCCTG CCCCTTACCC GATACCTGTC CGCCTTTCTC CCTTCGGGAA 
AGGCTGGGAC GGCGAATCGC CTATGGACAG GCCGAAAGAC GGAAGCCCTT 



50O1 GCGTGGCGCT TTCTCAATGC TCACGCTGTA GGTATCTCAG TTCGGTGTAC 
CGCACCGCGA AAGAGTTACQ AGTGCGACAT CCATAGACTC AAGCCACATC 
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5301 CCTTCGGAAA AW»GTTGGt AGCTCTTGAT CCGGCAAACA AACCACCGCT 
GGJlAGCCrr? TTCTCAACCA TCGAGAACTA GGCCCTTTGT TTCCTGGCGA 



5451 GCAACOWlAX CTCACGTTAA GGGATTTTGG TCATCAGATT ATCAAAAAGG 
CCTTGC7TTT GAGTGCAATr CCCTAAAACC AGTACTCTAA TAGTTTTTCC 



55S1 AAQTATATAT GAGTAAACTT GCTCTGACAG TTACCAATGC TTAATCAGTG 
TTCATATA7A CTCATTTGAA CCAGACTCTC AATGGTTACG AATTAGTCAC 



5601 AGGCACCTAT CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTCCCTGA 
TCCCTGQATA CAGTCGCTAG ACAGATAAAG CAAGTAGGTA TCAACGGACT 



5551 CTCCGGGGGG CGGGGGCCCT GAGGTCTGCC TCGTGAAGAA GCTGTTGCTG 
CAGGCCCCCC CCCCCCGCGA CTCCAGACGG AGCACTTCTT CCACAACCAC 



5951 GATTACAAAA ACTCATCGAG CATCAAATGA AACTGCAATT TATTCATATC 
CTAATCTTTT TGAGTAGCTC GTAGTTTACT TTGACGTTAA ATAAGTATAG 
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6451 TGCTGTTTTC CCGGGGATCC CAGTGGTGAG TAACCATGCA TCATCAGGAG 
ACGACAAAAG GGCCCCTAGC GTCACCACTC A7TGGTACGT AGTAGTCCTC 



6801 ACAGTTTTAT TGTTCATGAT GATATATTTT TATCTTGTGC AATGTAACAT 
TGTCAAAATA ACAAGTACTA CTATATAAAA ATAGAACACG TTACATTGTA 
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Drain 



£851 


CAGAGATTTT 
CTCTCTAAAA 


GAGACACAAC 


GTGGCTTTCC 
CACCGAAAGG 


CCCCCCCCCC 
GGGGGG6GGG 


ATTATTGAAC 
TAAPAACTTC 


€901 


CATTTATCAC 
GTAAATACTC 


GGTTATTGTC 
CCAATAACAC 


TCATGAGCGG 
AGTACTCGCC 


ATACATATTT 
TATGTATAAA 


GAATGTATTT 
CTTACATAAA 


6951 


AGAAAAATAA 
TCTTTTTATT 


ACAAATAGGG 
TGTTTATCCC 


CTTCCGCCCA 
CAAGGCCCGT 


CATPTCCCCG 
GTAAAGG6GC 


AAAAGT6CCA 
TTTTCACCGT 


7001 


CCrCACQTCT AAGAAACCAT TATTATCATC ACATTAACCT ATAAAAATAG 
CWACTGCACak TTCTTTQGTA ATAATAGTAC TGTAATTGOA TATTTTTATC 


70S1 


GCGTATCACG 
CGCATAGTGC 


AGGCCCTTTC 
TCCGGGAAAG 


CTC 
CAG 
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General Description ^^(iKk-ev^CJi^ Q 

DNA pVR 10X2 -GP(Z) ^ 
Local Object 

Creabadli 09/15/98 05:06PM 

Lost Modilicacion Date: ? (no data) 

lengthj 7285 bp 

storage type-. Basic 

fonr.: Circular 



Restriction Map 
Drain: 1 site 
Hlndlll: 1 site 
Hpahlsite 
Kasl:lsite 
Narl: 1 site ggg^ 
Notlilsite ggg^i 
Pmltnsite 
Pvuinsite ^3^^ 
Sacil: 1 site 
XbalMsite 
Xholrlsite 

EcoRV:2sites |^Jg:][| 

Ncol: 2 sites 

Ndel: 2 sites §^5c 

Sphl:2sites 

Functional Map 
CDS (4 signals) 
CMV IE 5" UT 

Startt 8B6 End: 1129 
CMV IE INT 

Start: 1130 End: 1940 
TbGH 

start: 4302 End: 4854 

Kan r 

start: 6350 End: 6972 {Car:?>leicBntary) 
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Mi6C_feature (2 signals) 
CMV enhancer 

start! 248 End: 
GP(Z) 

start! 1870 End: 

Annotations 
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1 TCGCCCGTTT CCaTGATGAC GGTG1UU«ACC TCTC*CACAT GCAGCTCCCG 
ACCGCGCAAA. GCCACTACTG CCACTTTTGG AGACTGTGTA COTCOAOCGC 



51 


GAGACGGTCA. 
CTCTGCCAGT 


CAGCTTCyrCT 
GTCGAACAGA 


GTAACCGGAT 
CATTCGCCTA 


CCCGGGAGCA GACAAGCCCG 
CGGCCCTCCT CTGTTCGGGC 


101 


TCAGGGCGCG TCAGCCGGTG 
AGTCCCGCGC AGTCGCCCAC 


TTCGCGGGTG 
AACCGCCCAC 


TCGGGGCTCG 
AGCCCCCACC 


CTTAACTATQ 
GAATTGATAC 










Ndez 




151 


CGGCXTCAGA. 
CCCGTAOTCT 


GCACArrGTA 
CGTCTAACAT 


CTGACAGTGC 
GACTCTCACG 


ACCATATGCC 
TGGTATACGC 


GTGTGAAATA 
CACACTTTAT 


201 


CCGCXCAGXT 
GGCGTGTCTA 


GCGTAAGGAG 
CCCATTCCTC 


AAAATACCGC 
TTTTATGGCG 


ATCAGATTGO 
TAGTCTAACC 


CTATTCGCCA 
GATAACCGGT 


251 


TTGCATACGT TGTATCCATA 
AACGTATGCA ACATAGGTAT 


TCATAATATC 
AGTATTATAC 


TACATTTATA 
ATGTAAATAT 


TTGGCTCATG 
AACCGAGTAC 


301 


TCCAACATTA 
AGGTTGTAAT 


CCGCCATGTT 
GGCGGTACAA 


CACATTGAT7 
CTGTAACTAA 


ATTGACTAGT TATTAATAGT 
TAACTGATCA ATAA'TTATCA 


351 


AATCAATTAC 
TTAGTTAATG 


GGCGTCATTA 
CCCCAGTAAT 


GTTCATAGCC 
CAAGTA7CGG 


CATATATGGA 
GTATATACCT 


GTTCCGCGTT 
CAAGCCGCAA 


401 


ACATAACTTA 
TGTATTGAAT 


CGGTAAATGG 
CCCATTTACC 


CCCGCCTGGC 
GGGCGGACCG 


TCACCGCCCA ACGACCCCCG 
ACTGGCCCGT TGCTGGGGGC 


<,51 


CCCATTCACG 
G6GTAACTGC 


TCAATAATCA 
AGTTATTACT 


CGTATGTTCC 
GCATACAAGC 


CATAGTAACG 
GTATCAT^GC 


CCAATAGGGA 
GCTTATCCCT 


501 


C7TTCCATTG 
GAAAGOTAAC 


ACGTCAATGG 
TGCAGTTACC 


GTCGAGTATT 
CACCTCATAA 


TACGGTAAAC 
ATCCCATTT3 


7GCCCACTTG 
ACGGGTGAAC 












551 


GCAGTACATC 
CGTCATQTAG 


AAGTGTATCA TATGCCAAGT 
TTCACATAGT ATACGGTrGA 


ACGCCCCCTA 
TGCGGCGGAT 


TTGACGTCAA 
AACTGCAGTT 


601 


TGACCGTAAA 
ACTGCCATTT? 


TGGCCCCCCT 
ACCGCGCGGA 


GCCATTATGC 
CCGTAATACG 


CCAGTACATG 
GGTCATGTAC 


ACCTTATGGG 
TGGAATACCC 












Mcol 


SSI 


ACTTTCCTAC 
TGAAACGATG 


TTGCCAGTAC 
AACCGTCATG 


ATCTACGTAT TAGTCATCGC TATTACCATG 
TAGATGCATA ATCAGTAGCC ATAATGGTAC 




NCOl 










701 


GTGATGCCGT 
CACTACGCCA 


TTTGGCAGTA 
AAACCOTCAT 


CATCAATGGG 
GTAATTACCC 


CGTGGATAGC 
CCACCTATCG 


CGTTTGACTC 
CCAAACTGAG 



751 ACGCGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATCGG AGTTTGTTTT 
TCCCCCTAAA GGTXGAGAGG TCGGGTAACT GCAGTTACCC TCAAACAAAA 



BOl GGCACCAAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA 
CCGTCGTTTT ACTTGCCCTC AAAGGTTTTA CAGCATTGTT GAGGCGGGCT 
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£51 


TTG&CGCA*A 
AACTCCGTTT 


TOGGCCGrrAG 
ACCCGCCATC 


GCCTGTACGG 
CGCACATCCC 


TCGGAGGTCT ATATAAGCAC 
ACCCTCCAGA TATATTCXJTC 


901 


A.GCTCGTTTA 


GTCAACCGTC 
CACTTGGCAG 


AGATCGCCTO 
TCTAGCCGAC 


OAGACCCCAT 
CTCTGCGGTA 


CCACCCTGTT 
GCTGCGACAA 


951 


TTGACCTCCA TAGAAXIACAC 
AACTGGAGGT ATCTTCTGTG 


saclZ 

CGGGACCGAT CCAGCCTCCG CGGCCGGGAA 
GCCCTGGCTA GGTCGGAGGC GCCGGCCCTT 


iOOl 


CCGTGCATTG 
GOCACCTAAC 


GAACGCGGAT 
CTTGCGCCTA 


TCCCCGTGCC AAGAGTGACG 
AGGGGCACGG TTCTCACTGC 


TAAGTACCGC 
ATTCATGGCG 


1051 


CTATAGACTC 


TATAGGCACA 
ATATCCGTGT 


ccccrrrccc 

GGCGAAACCC 


Bpii: 

TCTTATGCAT 
AGAATACGTA 


C 

GCTATACTGT 
CQATATGACA 


1101 


TTTTCGCTTG 
AAhACCGAAC 


GGGCCTATAC ACCCCCGCTT CCTTATGCTA TAGGTGATGG 
CCCGGATATG TGGGCGCGAA CGAATACGAT ATCCACTACC 


1151 


TATAGCTTAG 
ATATCGAATC 


CCTATAGOTQ TOGGTTATTG ACCATTATTC ACCACTCCCC 
GGATATCCAC ACCCAATAAC TGGTAATAAC TGGTGACGGG 


1201 


TATTGGTGAC GATACTTTCC 
ATAACCACTG CTATGAAAGG 


AITACTAATC 
TAATGATTAG 


CATAACATGG 
GTATTGTACC 


CTCTTTGCCA 
GAGAAACGGT 


12S1 


CAACTATCTC TATTGGCTAT 
GTTGATAGAG ATAACCGATA 


ATGCCAATAC 
TACGGTTATG 


TCTGTCCTTC 
AGACAGOAAG 


AGAOACTGAC 
TCTC7GACTG 


1301 


ACGGACTCTC TATTTTTACA 
TCCCTGAGAC ATAAAAATGT 


GGATGGGGTC 
CCTACCCCAG 


CCATTTATTA 
GG7AAATAAT 


TTTACAAATr 
AAATGTTTAA 


1351 


CACATATACA 
GTCTATATGT 


ACAACGCCCT 
TGTTGCGGCA 


CCCCCCTGCC 
GGGGGCACGG 


CGCAGTTTTT 
GCG7CAAAAA 


ATTAAACATA 
TAATTTGTAT 


1401 


GCGTGGCSATC 
CGCACCCTAG 


TCCACGCGAA 
AGGTGCGCTT 


TCTCGGGTAC 
AGAGCCCATG 


GTGTTCCGGA 
CACAAGGCCT 


CATGGGCTCT 
GTACCCGAGA 


1451 


CCTCCGGTAG 
AQAGGCCATC 


CGGCGGAGCT 
GCCGCCtCGA 


TCCACATCCG 
AGGTGTAGGC 


AGCCCTGGTC 
TCGGGACCAG 


CCATGCCTCC 
GGTACaOAGG 


1501 


AGCGCCTCAT 
TCCCCGAGTA 


CC5CGCTCGG 
CCAGCCAGCC 


CAGCTCCTTG 
CTCGAGGAAC 


CTCCTAACAC 
GAGGATTGTC 


TGGAGGCCA6 
ACCTCCGGTC 


1551 


ACTTAGGCAC 
TGAATCCGTG 


AGCACAATGC 
TCGTCTTACG 


CCACCACCAC 
CGTGGTGGTC 


CACTGTGCCG 
GTCACACCGC 


CACAAGCCCG 


1601 


TGGCGGTAGG 
ACCCCCATCC 


GTATGTGTCT GAAAATGAGC GTGGAOATTG 
CATACACAGA CTTTTACTC6 CACCTCTAAC 


GGCTCGCACG 
CCGAGCGTGC 



IfiSl GCTGACGCAG ATGGAAGACT TAAGGCAGCG GCAGAAGAAC ATGCAGCCAG 
OSACTGCCTC TACCTTCTGA ATTCCGTCCC CGTCTTCTTC TACCTCCGTC 



1701 CTGAGTTCCT GTATTCTOAT AACAGTCAGA GGTAACTCCC GTTGCGQTGC 
GACTCAACAA CATAAGACTA TTCTCACTCT CCATTGAGGG CAACGCCACC 
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1751 TGTTAACGGT GGACCGCAGT GTAGTCTGAG Ca.CTACTCGT TGCTCCCGCG 
ACAATTGCCA CCTCCCGTCX CATCAGACTC GTCATGAGCA ACGACGGCGC 



1801 CGCSCCACCA GACATAATAG CVGACAGACT AACW»CTGT TCCTTTCCAT 
GCCCGGTGGT CTGTATTATC GACTGTCTGA TTGTCTCACA AGGAAACtSTA 



1851 GGGTCrrrxC TGCACTCRCC GTCCTCGACA CGTGTGATCA CATATCGCGG 
CCCAGAAAAQ ACGTC/lGTGG CAGCAGCTGT CCACACTACT CTATAGCGCC 



1901 CCGCTCTAGA CCAGGCGCCT GGATCGATCC GCGATGAAGA TTAAGCCGAC 
GGCGAGATCT GCTCCGCCCA. CCTAGCTAGG CGCTACTTCT AATTCGGCTG 



1951 AGTCAGCGTA ATCTTCATCT CTCTTACATT ATTTGTTTTC CACAGTAGGG 
TCACTCGCAT TAGAAGTAGA GA6AATCTAA TAAACAAAAG GTCTCATCCC 



2001 GTCGTCAGGT CCTTTTCAAT CGTGTAACCA AAATAAACTC CACTAGAAGG 
CAGCAGTCCA GGAAAAGTTA GGACATTGGT TTTATTTGAG GTCATCTTCC 



2051 ATATTGTGGG GCAACAACAC AATGGGCGTT ACACGAATAT tCCAGTrACC 
TXTAACACCC CGTTGTTGTG T^TACCCGCAA TGTCCTTATA ACGTCAATGG 



TCGTGATCGA TTCAAGAGGA CATCATTCTT TCTTTGGGTA ATTATCCTTT 
AGCACTAGCT AAGTTCTCCT GTAGTAAGAA AGAAACCCAT TAATAGGAAA 



2151 TCCAAAGAAC ATT^TCCATC CCACTTCGAG TCATCCACAA TACCACATTA 
AGGTTTCTTG TAAAAGGTAG GGTCAACCTC ACTAGGTGTT ATCGTGTAAT 



CACCTTAGTG ATGTCCACAA ACTAOTTTGT CGTGACAAAC TGTCATCCAC 
CTCCAATCAC TACAGCTGTT TGATCAAACA GCACTGTTTG ACAGrrAGGTG 



2251 AAATCAATTG AGATCAGTTG GACTGAATCT CGAACGGAAT GGAGTGGCAA 
TTTAGTTAAC TCTAGTCAAC CTGACTTAGA GCTTCCCTTA CCTCACCGTT 



2301 C7GACGTGCC ATCTGCAACT AAAACATGGG GCTTCAGGTC CGGTGTCCCA 
6ACTGCACGG TAGACGTTGA TTTTCTACCC CGAAGTCCAG GCCACAGGGT 



2351 CCAAAGGTCG TCAATTATGA AGCTGGTGAA TGGGCTGAAA ACTGCTACAA 
GGTTTCCACC AGTTAATACT TCGACCACTT ACCCGACTTT TOACGATGTT 

2401 TCTTGAAATC AAAAAACCTG ACGCGAGTGA GTCTCTACCA CCAGCGCCAfl 
AOAACTTTAG TTMTTGGAC TGCCCTCACT CACAGATGGT CGTCGCGGTC 

2<51 ACGGGATTCG GGOCTTCCCC CQGTGCCCGT ATGTCCACAA AGTATCAGGA 
TGCCCTAACC CCCGAACGGG GCCACGGCCA TACACGTGTT TCATAGTCCT 



2501 ACGGGACCCT GTGCCCGAGA CTTTGCCTTC CATAAASAGG GrGCTTTCTT 
TGCCCTGGCA CACGGCCTCT GAAACGGAAG GTATTTCTCC CACGAAAGAA 
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2 SSI CCTGTATCAT C£3ACTTCCTT CCACAQTTAT CTACCGAGGA ACGACTTTCC 
OGACATACTA GCTSAACOAA CCTOTCAATA GATGGCTCCT TGCTGAAAGC 

i'eOl CTGAAC5GTCT CGTTGCATTT CTGATACTGC CCCAAQCTAA GAAGGACTTC 
GACrrCCACA GCAACGTAAA GACTATGACG GGGTTCGATT CTTCCTCfcAG 

2651 TTCAGCTCAC ACCCCTTGAG AGACCCGGTC AATGCAACGG AGGACCCCTC 
AAGTCCAGTG TGGCGAACTC TCTCGGCCAO TTACCTTGCC TCCTOGGCAG 

ECORV 

2701 TACTGGCTAC TATTCTACCA CAATTAGATA TCAGGCTACC GOTTTTGCAA 
ATCACCGATG ATAACATGGT GTTAATCTAT AGTCCGATGC CCAAAACCTT 

27S1 CCAATGAGAC AGAGTACTTG TTCGAGGTTG ACAATTTGAC CTACGTCCAA 
GGrTACTCTG TCTCATGAAC AAGCTCCAAC TGTTAAACTG GATGCAGGTT 

2801 CTTGAATCAA GATTCACACC ACAGTTTCTG CTCCACCTGA ATGAGACAAT 
GAACTTAGTT CTAAGTGTGC TGTCAAAGAC GAGGTCGACT TACTCrTGTTA 

2851 ATATACAAGT CGGAAAAGGA GCAATACCAC GGCAAAACTA ATTTGGAAGC 
TATATGTTCA CCCTTTrCCT CGTTATGGTG CCCTTT7GAT TAAACCrTCC 

2901 TCAACCCCGA AATTGATACA ACAATCGGGG AGTGGGCCTT CTGGGAAACT 
AGTTGGGGCT TTAACTATGT TGTTAGCCCC TCACCCGOAA GACCCTTTGA 

29S1 ARAAAAAACC TCACTAGAAA AATTCGCAG^ CAAGAGTTCT CTTTCACAGT 

•rrrrrrrTGG agtgatcttt ttaagcgtca cttctcaaca gaaagtgtca 

3001 TGTA3CAAAC GGAGCCAAAA ACATCAGTCG TCAGRGTCCG GCGCGAACTT 
ACATACTTTG CCTCGGTTTT TGTAGTCACC AGTCTCAGGC CGCGCTTGAA 

3051 CTTCCCACCC AGGGACCAAC ACAACAACTG AAGACCACAA AATCATGGCT 
GAAGGCTGGG ICCCTGGTTG TCTTGTTGAC TTCTGGTGTT TTAGTACCGA 

3101 TCAGAAAATT CCTCTCCAA7 GGTTCAAGTG CACAGTCAAG GAACGGA^ 
AGTCTTrCAA GGACACGTTA CCAAGTTCAC GTG-rCAGTTC CTTCCCTTCG 

3151 TGCAGTGTCG CATCTAACAA CCCTTGCCAC AATCTCCihCG AGTCCCCAAT 
ACGTCACAGC GTAGATTGTT GGGAACGGTG TTACAGGTGC TCAGGGGTTA 

3201 CCCTCy^CAAC CAAACCAGGT CCGGACAACA GCACCCATAA TACACCCGTG 
GGGAGTGTTG GTTTGGTCCA CGCCTGTTGT CGTGGCTATT ATCTGGGCAC 

3251 TATAARCTTG ACATCTCTGA GGCAACTCAA GTTGAACAAC ATCACCGCAG 
ATATTTCAAC TGTAGAGACT CCGTTGAiGTT CAACTTGTTG TAGTGGCCTC 

3301 AACAGACAAC (5ACAGCACAG CCTCCGACAC TCCCTCTGCC ACGACCGCAG 
TTGTCTGTTG CTCTCGTGTC GGAGGCTCTG AGGGAGACGG TCCTGGCGTC 

3351 CCGGACCCCC AAAAGCAGAG AACACCAACA CGAGCAAGAG CACTGACTTC 
GGCCTOOGGG TTTTCGTCTC TTCTCGTTGT GCTCGTTCTC GTGACTCAAG 



3401 CTGGACCCCG CCACCACAAC AAGTCCCCAA AACCACAGCC AGACCGCTGG 
GACCTGGCGC GGTGCTGTTG TTCAGGGGTT TTGGTGTCGC TCTGGCOACC 
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4401 AATG3UK3AXA TTGCATCCCA TTGTCTGAQT ASCTGTCATT C^ATTCTGGG 
TTACTCCTTT AACCTAGCGT AACAGACTCA TCCACAGTAA GATAACACCC 



4451 COGTGGGCTO GGGCXOCACA GCAXOaOOOA COATTOGCAA GACAATACCA. 

CCCACCCCAC CCCOTCGTGT CGTTCCCCCT CCTAACCCTT CTGTTATCGT 



4501 GOCATGCCGG GGATCCGGTG GCCTCTATGG GTACCCACCT GCTGAACRAT 
CCGTACGACC CCTACGCCAC CCGAGATACC CATGCGTCCA CGACTTCTTA 



4551 TQACCCCGTT CCTCCTGGCC CAGAAACAAO CAGGCACATC CCCTTCTCTG 
ACTGGGCCAA GGAGOACCCG GTCTTTCTTC GTCCGTGTAG GCCAAGA6AC 



4601 TGACACACCC TCTCCACGCC CCTGGTTCTT AGTTCCAGCC CCACTCATAG 
ACTGTGTGGG ACAQGTGCGG GOACCAAGAA TCAAGGTCGG GGTGAGTATC 



4651 GACACTCATA GCTCAGCACG GCTCCGCCTT CAATCCCACC CGCTAAAGTA 
CTGTGAGTAT CGAGTCCTCC CGACOCGQAA CTTAGGGTGG CCGATTTCAT 



4701 CTTGGAGCCG TCTCTCCCTC CCTCATCAGC CCACCAAACC AAACCTAGCC 
GAACCTCOCC AGAGAGGGA6 CGAGTAGTCC GGTGGTTTGG TTTGGATCGG 



4751 TCCAAGAGTG GGAAGAAATT AAAGCAACAT AGGCTATTAA GTGCACAGGG 
AGGTTCTCAC CCTTCTTTAA TTTCGTTCTA TCCGATAATT CACGTCTCCC 



4801 AGAGAAAATG CCr?CCAACAT CTGAGGAAGT AATGAGAGAA ATCATAGAAT 
TCTCTTTTAC GGAGGTTCTA CAC7CCTTCA TTACTCTCTT TAGTATCTXA 



4851 TTCTTCCGCT TCCTCGCTCA CTGACTCGC7 GCGCTCGGTC GTTCGGCTGC 
AAGAAGGCGA AGGAGCCAGT GACTGAGCGA CCCGAGCCAC CAAGCCGACG 



4901 GGCCW3CGGT ATCAGCTCAC TCAAAGGCGG TAATACGGTT ATCCACAGAA 
CCGCTCCCCA TAGTCGAGTG AGTTTCCGCC ATTATGCCAA TAGGTGTCTT 



43 51 TCAGGGGATA ACGCAGGAAA CJUICATGTCA GGAAAACCCC AGC AAAAG GC 
AGTCCCCTAT TGCGTCCTTT CTrTGTACACr CGTTTTCCGG TCGTTTTCCQ 



=^001 CAGGAACCGT AJUJIAGGCCG CGTTGCTGGC GTTTrrCCAT AGGCTCCGCC 
GTCCTTGGCA TTTTTCCCGC GCAACGACCG CAAAAAGGTA TCCGACGCGG 



5051 CCCCTGACGA GCATCACAAA AATCGACGCT CAAGTCACAG GTGGCCAAAC 
GGGGACTGCT CGTAGTGTTT TTAGCTGCGA GTTCACTCTC CACCGCTTTG 



SlOl CCGACAGGAC TATAAAGATA CCAGGCGTTT CCCCCTGGAA GCTCGCTCGT 
GCCTGTCCTG ATATTTCTAT G6TCCGC2UA GGCGGACCTT CGAGGGACCA 



5151 GCGCTCTCCT GTTCCGACCC TCCCGCTTAC CGGATACCTG TCCGCCTTTC 
CGCGAGAGGA CAAGGCTGCG ACGGCGAATG GCCTATGGAC AGCCGGAAAO 



5201 TCCCrrCGGG AAGCGTGGCC CTTTCTCAAT GCTCACGCTG TAGCTATCTC 
ACGGAAGCCC TTCGCACCGC GAAAGAGTTA CGAOTGCGAC ATCCATACAG 



5251 AGTTCGCTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC ACGAACCCCC 
TCAAGCCACA TCCAGCAACC GAGGTTCGAC CCGACACACG TGCTTGGGGG 
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5301 CGTTCACCCC GACCQCTGCG CCTTATCCCO TAACTATCGT CTTCAGTCCA 
GCAAGTCGGC CTOGCGACGC GGAATAOGCC ATTOATAGCA GAACTCACCT 



5351 ACCCGGTAAC ACACGACTTA TCGCCACTGC CAOCAGCCAC TGOTAACAGG 
TCGGCCATTC TGTGCTGAAT AGCGGTGACC GTCGTCGCTG ACCATTGTCC 



5401 ATTAGCAGAO CGAGCTATGT AGGCGGTGCT ACAGAGTTCT TQAAGTCGTG 
TAATCGTCTC GCTCCATACA TCCGCCACCA TGTCTCAAGA ACTTCACCAC 



54S1 CCCTAACTAC GGCTACACTA CAAGGACAGT ATTIGGTATC TGCGCTCTGC 
CGGATTGATG CCCATGTGAT CTTCCTGTCA TAAACCATAG ACCCGAGACG 



5501 TGAAGCCAGT TACCTTCCCA AAAAGAGTTG GTACCTCTTO ATCCGGCAAA 
ACTTCGGTCA ATGGAAGCCT TTTTCTCAAC CATCGAfiAAC TAGGCCGTTT 



5551 CAAACCACCG CTGGTACCGG TGGTTTTTTT GTTTGCAAGC AGCAGATTAC 
CTTTGGTGCC GACCATCGCC ACCAAAAAAA CAAACGTTCG TCGTCTAATG 



5501 GCGCAGAAAA AAAGGATCTC AAGAASATCC TTTGATCTTT TCTACGGGGT 
CGCCTCTTTT TTTCCTACAa TTCTTCTAGG AAACTAGAAA AGATGCCCCA 



SeS-i CTGACGCTCA GTGGAACGAA AACTCACGTT AAGGGATTTT GGTCATGACA 
GACTGCGAGT CACCTTGCTT TTGAGTGCAA TTCCCTAAAA CCAGTACTCT 



5701 TTATCAAAAA GGATCTTCAC CTASATCCTT TTAAATTAAA AATGAAGTTT 
AATAGTTTTT CCTAGAAGTG GATCTACGAA AATTTAATTT TTACTTCAAA 



5751 TAAATCAATC TAAAGTATAT ATGAGTAAAC TTGGTCTOAC AGTTACCAAT 
ATTTAGrrAG ATITCATATA TACTCATTTG AACCAGACTG TCAATGSTTA 



5801 GCTTAATCAC TGAGGCACCT ATCTCAGCGA TCTGTCTATT TCCTTCATCC 
CGAATTAGTC ACTCCGTGGA TAGAGTCGCT ACACAGATAA AGCAAGTACG 



5851 A-AGTTGCCT CACTCCGCCG GGGGCGGGCG CTGAGGTCTC CCTCGTGAAG 
TA'iCAACCCA CTGAGGCCCC CCCCCCCCCC GACTCCAGAC GGAGCACTTC 



5901 AAGGTGTTGC TGACTCATAC CAGGCCTGAA TCCCCCCAXC ATCCAGCCAG 
nCCACAACG ACTGAGTATG GTCCGGACTT AGCGGGCTAQ TAGGTCGGTC 



5951 AAAGTGAGGG AGCCACCGTT GATGAGAGCT TTGTTCTAGG TGCACCAGTT 
TT5CACTCCC TCGGTCCCAA CTACTCTCCA AACAACATCC ACCTGGTCAA 



SO 01 GGTGATTTTG AACTTTTGCT TTGCCACGCA ACGGTC7GCG TTCTCGGGAA 
CCACTAAAAC TTGAAAACGA AACGGTGCCT TCCCAGACCC AACAGCCCTT 



6051 GATGCGTGAT CTGATCCTTC AACTCAGCAA AAGTTCGATT TATTCAACAA 
CTACGCACTA GACTAGGAAG TTGAGTCCTT TTCAAGCTAA ATAAGTTGTT 



6101 aoccgccgtc ccctcaagtc agcgtaatgc tctgccagtg ttacaaccaa 
tcggcggcag cgcagttcag tcgcattacg ACACGGTCAC aatgttgctt 



6151 TTAACCAATT CTGATTAGAA AAACTCATCG AGCATCAAAT GAAACTGCAA 
AATTGGTTAA. GACTAATCTT TTTGAGTAGC TCGTAGTTTA CTTTGACGTT 



6201 TTTATTCATA TCAGGATTAT CAATACCATA 1TTTTGAAAA AGCCGTTTCT 
AAATAAGTAT ACTCCTAATA GTTATGGTAT AAAAACTTTT TCCGCAAAGA 
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6251 CTAATGAAGG AGAXAACTCA CCQAGCCACT TCCATACGAT GCCAAGATCC 
CATTACTTCC TCTTTTGAGT GGCTCCOTCA AGGTATCCTA CCGTTCTAGG 



6301 


TGGTA-rCGGT CTCCOATTCC GACTCGTCCA ACATCAATAC 
ACCATAGCCA GACGCTAAGG CTGACCAGGX TGTAGTTATG 


AACCTATTAA 
TTGGATAATT 


6351 


TTTCCCCTCG TCAAAAATAA GGTTATCAA6 TCAGAAATCA CCAT6AGTGA 
AAAGGGGAGC AGTTTTTATT CCAATAGTTC ACTCTTTAGT GGTACTCACT 




Hindlll 




6401 


CGACTGAATC CGGTGACAAT GGCAAAAGCT TATGCATTTC 
GCTGACTTAC GCCACTCTTA CCGTTTTCGA ATACGT&WIG 


TTTCCAGACT 
AAAGGTCTGA 


S451 


TGTTCAACAG GCCAGCCATT ACGCTCGTCA TCAAAATCAC 
ACAAGTTGTC CGGTCGGTAA TGCGAGCAGT AGTTTTAGTG 


TCQCATCAAC 
AGCGTAGTTG 






PVUI 


6S01 


CAAACCGr^A TTCATTCGTG ATTGCGCCTG AGCGAGACGA 
GTTTGGCAAT AAQTAAGCAC TAACGCGGAC TCGCTCTGCT 


AATACGCGAT 
TTATGCGCTA 



6551 


Pvul 

CGCTGTTAAA 
CCGACAATTT 


AGGACAATTA CAAACAGGAA TCGAATGCAA CCGGCGCAGG 
TCCTGTTAAT GTTTGTCCTT AGCTTACGTT GCCCGCGTCC 


€601 


AACACTGCCA 
TTGTCACGGT 


GCGCATCAAC AATATTTTCA CCTGAATCAG CATATTCTTC 
CGCGTAGTTG TTATAAAAGT CGACTTAGTC CTATAAGAAG 


6651 


TAATACCTGG 
ATTATGGACC 


AATGCTGTTT TCCCCGGGAT CGCAGTGGTG AGTAACCATG 
TTACGACAAA AGCGCCCCTA GCGTCACCAC TCATTGGTAC 


6701 


CATCATCAGG 
GTAGTAGTCC 


AGTACGGATA AAATGCTTGA TGGTCGGAAG AGGCATAAAT 
TCATGCCTAT TTTACGAACT ACCAGCCTTC TCCCTATTTA 


6751 


TCCGTCAGCC 
AGGCAGTCGG 


AGTTTAGTCT GACCATCTCA 7CTGTAACAT CATTGGCAAC 
TCAAATCAGA CTGGTACAGT AGACATTGTA GTAACCCTTG 


6801 


GCTACCTTTG 
CGATGGAAAC 


CCATGTTTCA GAAACAAC^IC TGGCGCATCG GGCTTCCCAT 
GGTACAAAGT CTTTGTTGAG ACCGCGTAGC CCGAAGCGTA 


6851 


ACAATCGATA GATTGTCGCA CCTGATTCCC CGACATTATC CCGAGCCCAT 
TGTTAGCTAT CTAACAGCGT GGACTAACG6 CCTGTAATAG CGCTCCGGTA 



Xbol 

6901 TTATACCCAT ATAAATCACC ATCCATCTTG GAATTTAATC GCGGCCTCGA 
AATATGGGTA TATTTAGTCG TACOrTACAAC CTTAAATTAG CGCCCGAGCT 



6951 GCAACACGrr TCCCGTTGAA TATGGCTCAT AACACCCCTT GTATTACTGT 
CGTTCTGCAA AGGGCAACTT ATACCGAGTA TTGTGGGGAA CATAATGACA 



7001 TTATGTAAGC AGACAGTTTC ATTGTTCATC ATGATATATT TTTATCTTGT 
AATACATTCG TCTGTCAAAA TAACAAGTAC TACTATATAA AAATAGAACA 



wo 99/32147 



33 



PCTAJS98/27364 



70S1 


Drain 

GCAATGTAAC ATCyiQAQATT TCGAGACACA ACGTGGCTTT 
CGTTACATTG TAGTCTCTAA AACTCTGTGT TQCACCGAAA 


CCCCCCCCCC 
GCGGGGGGGG 


7101 


CCATTATTGA AGCATTTATC AGCCTTATTG TCTCATGAGC 
GGTAATAACT TCGTAAATAC TCCCAATAAC AGAGTACTCC 


GOATACATAT 
CCTATGTATA 


7151 


TTGAATGTAT TTAGAAAAAT AAACAAATAC GGGTTCCGCC 
AACTTACATA AATCTTTTTA TTTCTTTATC CCCAAGGCGC 


CACATTTCCC 
GTGTAAAGGG 


7201 


CGAAAAOTGC CACCTGACGT CTAAGAAACC ATTATTATCA TGACATTAAC 
GCTTTTCACG GTOGACTGCA GATTCTTTGG TAATAATAGT ACTGTAATTC 


7251 


CTATAAAAAT AGGCCTATCA CGACGCCCTT TCOTC 
GATATTTTTA TCCGCATAGT CCTCCGGGAA AGCAG 
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1012-30? ^ I . J- xrt) MO'. 4 

GenerafDescnptlon ^eC ugnCJL L-'^^ ^ 

«««« —tm 1 m _ / T ^ ^ ^ 



DNX pVR 1013-SCP(Z) 
local object 
Cxeatad. 09/14/98 04j29PM 
lAAt Modlficdj 09/15/98 04i50PM 
lenijtt: 7272 bp 
storag« type: Basic 
fozmi Circular 



Restriction Map 

Drain: 1 site exGRUNCAC 
HIndllhlsite 
Hpat: 1 site 
Kpnl:tsite 

Noticisite gg^iii^ 

Pvulrtsite §^Jgg 
Sacinisfte ccggG 
Xbalrlsite 
Xho!:1site 

EcoRV: 2 sites g^^^^c 
NcQl:2sites 
Ndel:2sites g$5^~ 
Spht:2sites 

Functional Map 
CDS (4 signals) 
CMV IE 5' UT 

Starti 886 End» 1129 
CMV IE INT 

start: 1130 Endi 1S40 

TbGH 

start: 4289 End: 4841 

Kan r 

start r 6337 Ends €953 (Complamentary) 
Misc_feature (2 signals) 
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CMV enhancer 

start: 248 Endi 885 
SGP(Z) 

starti 1870 Endt 4288 



Annotations 
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TCCCGCGTTrr CGtSTCATGAC GGTGWUACC TCTQACACAT GCAGCTCCCO 
ACCGCGCAAA GCCACTACTG CCACTTTTCC AQACTCTGTA CCTCOAOCCC 

GAGACGG7CA CA.GCTTGTCT QTAAGCGGAT CCCGGC».CCA GACAAQCCCG 
CTCTGCCAGT GTCGAACAGA CATTCGCCTA CGGCCCTCGT CTGTTCGCCC 



101 TCAGGGCGCG TCAGCGGGTG TTGGCGtSGTG TCGGCGCTGG CTTAACTATG 
ACTCCCGCGC ag«:gcccac AACCGCCCAC AGCCCCGACC GAATTGATAC 



151 CGGCATCAGA GCACATTGTA CTGAGAGTGC ACCATATGCG GTQTGAAATA 
CCCGTACTCT CGTCTAACAT GACTCTCACG TCCTATACGC CMafiTTTAT 



201 ccqcacagat gcgtaaggag aaaataccgc atcaoattog ctattgccca 
ggcctgtcta cgcattcctc ttttatggcg tagtctaacc gataaccggt 



251 TTGCATACGT TGTATCCATA TCATAATATG TACATTTATA TTGGCTCATG 
AACGTATGCA ACATAGGTAT AGTATTATAC ATGTABATAT AACCGAGTAC 



301 TCCAACATTA CCGCCATGTT GACATTGATT ATTGACTAGT TATTAATAGT 
ACCTTGTAAT GGCGCTACAA CTGTAACTAA TAACTGATCA ATAATTATCA 



351 AATCAATTAC GGGGTCATTA GTTCATAGCC CATATATGGA GTTCCGCCTT 
7TACTTAATC CCCCACTAAT CAAGTATCGG GTATATACCT CAAGGCGCAA 



401 ACATAACTTA CGGTAAATGG CCCGCCTQGC TCACCGCCCA ACGACCCCCG 
TGTATTGAAT GCCATTTACC GGGCGGACCG ACTGGCGGGT TGCTGGGGGC 



451 CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 
GGGTAACTGC AGTTATTACT CCATACAAGG QTATCATTGC GCTTATCCC? 



501 CTCTCCATTC ACGTCAATGG GTGGAGTRTT TACGGTAAAC TGCCCACTTG 
GAAAGGfAAC TGCAGTTACC CACCTCATAA ATGCCATTCG ACGGGTGAAC 



551 GCAGTACATC AAGTCTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA 
CGTCATGTAG TTCACATAGT ATACGCTTCA TCCGGGGGAT AACTGCAGTT 



601 TCACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATC ACCTTATGGG 
ACTGCCATTT ACCGGGCGGA CCGTAATACG GGTCATGTAC TGGAATACCC 



651 ACTTTCCTAC TTGGCACTAC ATCTACGTAT TAGTCATCGC TATTACCATG 
TCAAAGCATG AACCGTCATC TAGATCCATA ATCAGTAGCG ATAATGCTAC 



701 OTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC 
CACTACGCCX AAACCGTCAT GTAGTTACCC GCACCTATCG CCAAACTGAG 

751 ACGGGGATTT CCAAG^CTCC ACCCCATTGR- CGTCAATCGG AGTrTGTTTT 
TGCCCCTAAA GGTTCAGAGC TGGGGTAACT GCAGTTACCC TCAAACAAAA 



eOl GGCACCAAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA 
CCCTGGTTTT AGTTGCCCTG AAAGGT^TTA CAGCATTCTT GAGGCGGGGT 
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1151 TATAGCTTAG CCTATACGTG TGGGTTATTC ACCATTATTG ACCACTCCCC 
ATATCGAATC GGA.TATCCAC ACCCAATAAC TGGTAATAAC TGGTGAGCGG 



1601 TCGCGGTAGG GTATOTGTCT GAAAATGAGC GTGGASATTC GGCTCGCACG 
ACCGCCATCC CATACACAGA CTTTTACTCG CACCTCTAAC CCGAGCGTGC 



1701 CTGAGTTGTT GTATTCTGAT AAGAGTCAGA GGTAACTCCC CTTGCGGTCC 
GACCCAACAA CATAAGACTA TTCTCAGTCT CCAWGAGGC CAACOCCACG 
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Hpal 

1751 TOTTAACGCT GGAGOGCAGT aTAGTCTGAG CAGTACTCGT TGCTGCCCCG 
ACAATTGCCA CCTCCCGTCA CATCAQACTC GTCATGAGCA XCGACGQCGC 



Hcol 

1801 CGCCCCACCA GACATAA.TAG CTGACAGACC AACAGACTGT TCCTTTCCAT 
CCGCGGTGGT CCCTATTATC GACTGTCTGA TTCTCTGACA AGGAAAGGTA 



Ncol Pall EcoRVNotI 



1851 GGCTCTTrrC TGCAGTCACC CTCGTCGACA CGTCTGATCA CA.TATCGCCC 
CCCAGAAAAQ ACCTCAGTGG CAGCACCTGT GCACACTAGT CTATAGCGCC 





NotX Xbal 








1901 


CCGCTCTAGA CCAGGCGCCT 
GGCSAGATCT GGTCCGCGGA 


GCATCGAATT GATGAAGATT AACCCGACAG 
CCTAGCCTAA CTACTTCTAA TTCGGCTGTC 


1931 


TGAGCGTAAT CTTCATCTCT CTTAGATTAT 
ACTCGCATTA GAACTAGAGA GAATCTAATA 


TTGTTTTCCA 
AACAAAAGGT 


GAGTAGGGGT 
CTCATCCCCA 


2001 


CGTCAGGTCC TTTCCAATCG 
CCAGTCCAGG AAAAGTTAGC 


TGTAACCAAA 
ACATTGGTTT 


ATAAACTCCA 
TATTTGAGGT 


CTAGAAGGAT 
GATCTTCCTA 


20S1 


AtTTGTGGGGC AACAACACAA 
TAACACCCCG TTGTTGTGTT 


TGGGCGTTAC 
ACCCGCAATG 


AGGAATATTO 
TCCTTATAAC 


CAGTTACCTC 
GTCAATGCAG 


2101 


GTGATCGATT CAAGAGGACA TCATTClTrc 
CACTAGCTAA GTTCTCCTCT AGTAAGAAAG 


TTTGGGTAAT 
AAACCCATTA 


TATCCTTTTC 
ATAGGAAAAG 


21S1 


CAAAGAACAT TTTCCATCCC 
GTTTCTTGTA AAAGGTAGGG 


ACTTGGAGTC 
TGAACCTCAG 


ATCCACAATA 
TAGGTCTTAT 


GCACATTACA 
CCTGTAATGT 


2201 


GGTTAGTGA7 GTCGACAAAC 
CCAATCACTA CAGCTCTTTG 


TAGTTTGTC6 
ATCAAACACC 


7GACAAACTG 
ACTGTTTGAC 


TCATCCACAA 
AGTAGGTCTT 


225X 


ATCAATTGAC ATCAGTTGGA 
TAGTTAACTC TAGTCAACCT 


CTGAATCTCG 
GACTTAGAGC 


AAGGGAATGG 
TTCCCTTACC 


AGTGGCAACT 
TCACCCCTGA 


2301 


GACGTGCCAT CTGCAACTAA 
CTGCACGGTA GACGTTGATT 


AAGATGGGGC 
TTCTACCCCG 


TTCAGGTCCG 
AAGTCCAGGC 


GTGTCCCACC 
CACAGGGTCG 


2251 


AAAGGTGGTC AATXATGAAG 
TTTCCACCAG TTAATACTTC 


CTGGTGAATC 
GACCACTTAC 


GGCTCAAAAC TCCTACAATC 
CCCACTTTTC ACGATCTTAO 


2401 


TTCAAATCAA AAAACCTGAC 
AACTTXAGTT TTTTGGACTC 


GGGAGTGAGT 
CCCTCACTCA 


GTCTACCAGC AGCGCCXGAC 
CA0ATGGTC6 TCCCGGTCTG 


24S1 


CCGAT7CCGG GCTTCCCCCC 
CCCTAAGCCC COAAGGGGGC 


GTGCCGGTAT GTGCACAAAG TATCAGGAAC 
CACCGCCATA CACGTGTrTC ATAGTCCTTG 


2501 


GCCACCGTGT GCCGGAGACT TTCCCTTCCA TAAAGAGGGT GCTTTCTTCC 
CCCTGSCACA CGCCCTCTOA AACGGAAGGT ATTTCTCCCA CCAAAGAACG 


2551 


TGTATGATCG ACTTGCTTCC 
ACA7ACTAGC TGAACGAAGG 


ACAGTTATCT ACCCAGGAAC 
TGTCAATAGA TGGCTCCTrG 


CACTTTCGCT 
CTGAAAGCGA 
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2601 CAA.GGTCTCG TTGCATTTCT GATACTGCCC CMGCTAAOA. AUGACTTCTT 
CTTCCACAGC AACGTAAAGA CTATQACGCG GTTCGATTCT TCCTGRAGAA 



2^51 CAGCXCACAC CCCTTGAGAO AGCCQGTCA*. TGCAACGGAG GACCCCTCTA 
GTCCAGTGTG GGGAACTCTC TCGGCCAGTT ACGTTGCCTC CTCGGCAOAT 



EcoRV 

2701 GTGGCTACTA TTCTACCACA ATTAGATATC AGGCTACCGG TTTTGGAACC 
CAtXGATGAT AAQATGGTGT TAATCTATAG TCCCATQGCC AAAACCTTOO 



2751 AATQACACAG AGTACTTCTT CGAGGTTGAC AATTTCACCT ACCTCCAACT 
TTACTCTGTC TCATCARCAA GCTCCAACTG TTAAACTGOA TGCACCTTGA 



280X TGAATCAAGA TTCACACCAC AGTTTCTGCT CCAGCTQAAT GAGACAATAT 
ACrTAGTTCT AACTGTGCTG TCAAAGACGA GGTCGACTTA CTCTQTTATA 



2851 ATACAACTGG GAAAAGGAGC AATACCACGG GAAAACTAAT TTGGAAGGTC 
TATGTTCACC CTTTTCCTCG TTATGGTGCC CTTTTGATTA AACCTTCCAG 



2901 AACCCCCAAA TTGATACAAC AATCCGG5AG TGGGCCTTCT GGCAAACTAA 
TTGGCGCTTT AACTATCTTG TTAGCCCCTC ACCCGGAAGA CCCTT7GATT 



2951 AAAAACCTCA CTAGAAAAAT TCGCAGTGAA GAGTrGTCTT TCACAGTTGT 
•mrTGGAGT GATCTTTTTA AGCGTCACTT CTCyiACAGAA AGTGTCAACA 



3001 ATCAAACGGA GCCAAAAACA TCAGTGCTCA GAGTCCGGCC CGAACTTCTT 
TAGTTTGCCT CGCrTTTTCT AGTCACCAGT CTCAGGCCGC GCTTGAAGAA 



3051 CCGACCCAGG GACCAACACA ACAACTGAAG ACCACAAAAT CATGCCTTCA 
CGCTGGGTCC CTGGTTGTGT TGTTOACTTC TGGTGTTTTA GTACCGAACT 



3101 GAAAA-PTCCT CTGCAATGGT TCAAGTGCAC ACTCAAGGAA GGCAAGCTGC 
CTTTTAAGGA GACGTTACCA AGTXCACGTG TCAGTTCCrT CCCTTCGACG 



3151 AGTGTCGCAT CTAACAACCC TTGCCACAAT CTCCACGAGT CCCCAATCCC 
TCACAGCGTA GATrGTTGGG AACGGTCTTA GACGTGCTCA GGGGTTAGGG 



3201 TCACAACCAA ACCAGGTCCG GACAACAGCA CCCATAATAC ACCCGTGTAT 
AGTGTTGGTT TGCTCCAGGC CTGTTG'PCGT GCCTATTATG TGGGCACATA 



3251 AAACTTGACA TCTCTGAGGC AACTCAAGTT GAACAACATC ACCGCAGAAC 
TTTGAACTGT AGAGACTCCG TTGACTTCAA COTGTTCTAG TCCCGTCTTO 



3301 ACACAACGAC AGCACAGCCT CCGACACTCC CTCTGCCACG ACCCCAGCCG 
TCTGTTGCTG TCGTGTCCGA GOCTGTGACG GAGACCGTGC TGGCGTCGGC 



3351 GACCCCCAA& AGCAGAGAAC JUCCAACACGA CCAAGACCAC TGACTTCCTC 
CTGCGGGTTT TCGTCTCTTG TGGTTGTGCT CGTTCTCGTC ACTGAAGGAC 



3401 GACCCCGCCA CCACAACAAG TCCCCAAAAC CACAGC6ACA CCGCTGGCAA 
CTGGGGCGGT GCTGTTGTTC AGGCGTTTXC GTGTCCCTCT GOCGACCGTT 



3451 CAACAKCACT CATCACCAAG ATACCG6AGA AGAGAGTGCC AGCAGCGCGA 
GTTGTTGTGA GTAGTGGTTC TATGGCCTCT TCTCTCACGG TCGTCCCCCT 



3501 


AGCTAGGCTT 
TCGATCCGAA 


AATTACCAAT ACTATTCCTG CAGTCGCACG 
TTAATGCTTA TGATAACGAC CTCACCCTCC 


ACTGATCACA 
TGACTAGTCT 


3551 


GGCCGGAGAA 
CCGCCCTCTT 


CAACTCGAAC AGA^USCAATT CTCAATCCTC 
CTTGAGCTTC TCTTCGTTAA CAGTTACQAG 


AACCCAAATC 
TTGGGTTTAC 


3601 


CAACCCTAAT TTACATTACT GGACTACTCA CGA,TGAACGT GCTCCAATCG 
CTTGGGATTA AATGTAATQA CCTGATGACT CCTACTTCCA CCACGTTAGC 




GACTGCCCTG 
CTGACCGGAC 


GATACCATAT TTCGGGCCXG CAGCCGAGGG 
CTATGGTATA AAGCCCCGTC GTCGGCTCCC 


AATTTACATA 
TTAAATGTAT 




GAGGGGCTAA 
CTCCCCGATT 


TGCACAATCA ASATGGTTTA ATCTGTGGGT 
ACGTGTTACT TCTACCAAAT TAOACACCCA. 


TGAGACAGCT 
ACTCTGTCGA 




GCCCAACGAG 
CCGGTXGCX. 


ACGACTCAAG CTCTTCAACT GTTCCTCAGai 
TGCTGACrrC GAGRAGTTGA CAAGGACTC? 


GCCACAACTG 
CCGTGTTGAC 




AGCTACGCAC 
TCGATGCGTG 


CTTTTCAATC CTCAACCGTA ACGCAATTCA 
GAAAAGTTAG GAGTTOOCAT TCCSTTAACT 


UlAGAACCAC 




CAGCGATGGG 
GTCCCTACCC 


GCGGCACATG CCACATTCTG GGACCGGACT GCTGTATCGA 
CGCCGTGTAC CGTGTAAGAC CCTGGCCTGA CGACATAGCT 




ACCACA.TQAT TGGACCAAGA ACATAXCAG^ CAAAATTGAT CAGATTATTC 
TGGTGTACTA ACCTCGTrCT TCWATrGTCT GTTTTAACTA GTCTAATAAQ 




ATQATTTTGr TCATAAAACC CTTCCGGACC AGGGGGACAA 
TACTAXAACA ACTATTTrGG GAACGCCTGG TCCCCCTGTT 


TGACAATTGC 
ACTGTTAIkCC 




TG6ACAGGAT 
ACCTGTCCTA 


GGA6ACAATG GATACCGGCA GGTATTGGAG TTACAGGCCT 
CCTCTGTTAC CTATCGCCffT CCATAACCTC AATCTCCGCA 


<051 


TATAATTGCA GTTATCGCTT TATTCTGTAT ATGCAAATTT 
ATATTAACGT CAATAGCGAA ATAAGACATA TACGTTTAAA 


GTCTTTTAGT 
CAGAAAATCA 


4X01 


TTTTCTTCAG 
AAAAGAAGTC 


ATTGCTTCAT GGAAAAGCTC AGCCTCAAAT 
TAACGAAGTA CCTTTTCCAG TCCGAGTTTA 


CAA7GAAACC 
GTTACTTTGG 


4151 


AGGATTTAAT 
TCCTAAATTA 


7ATATGGATT ACTTGAATCT AAGATTACTT 
ATATACCTAA TOAACTTAGA TTCTAATGAA 


GACAAATGAT 
CTGTTTACTA 


4201 


AATATAATAC ACTGGACCOT TAAACATAGC CAATGTGATT 
TTATATTATG 7GACCTCGAA ATTTGTXTCG GTTACACTAA 


CTAACTCCTT 
GATTCAGGAA 


4251 


TAAACTCACA 
ATTTCAGTGT 


GTTAATCATA AACAAGGTTT CGAATTGATC 
CAATTACTXT rTGTTCCAAA CCTTAACTAG 


TGCTGTGCCT 
ACGACACGGA 


4301 


TCTAGTTGCC 
AaAl<CAACGG 


AGCCATCTGT TCTTTOCCCC TCCCCCGTGC 
TCGCTAGACA ACAAACGCGG AGGGCGCACG 


CTTCCTTGAC 
GAAGGAACTG 



4351 CCTCGAAGGT GCCACTCCCA CTGTCCTTTC CTAATAAAAT GAQGAJATTQ 
GQACCTTCCA CCCTGAGCGT GACAGGAAAG GATTArTTTA CTCCTTTAAC 



4401 CATCGCATTG TCTGAGTAGG TGTCATTCTA TTCTGGCCGG TGCGGTQGGG 
GTAGCGTAAC AQACTCATCC ACAGTAACUiT AAGACCCCCC ACCCCACCCC 
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4501 TGCGGTCGGC TCXAIGGGTA CCCAGGTGCT GAAGXATTGA CCCGGTTCCT 
ACGCCACCCG ACATACCCAT GGGTCCACGA CTTCTTAACT GGGCCAAGGA 



5301 CCC7GCGCCT TATCCGGTAA CTATCCTCTT GAGTCCAACC CGGTAAGACA 
GCGACGCGGA ATAGGCCATT GATAGCAGAA CTCAGGTTGG GCCATTCTGT 



5351 


CCACTTATCG 
GCTGAATAGC 


CCACTQGCAG CAGCCACtGG TAACAGGATT AGCAGAGCGA 
GGTOACCCTC GTCCGTOACC ATTGTCCTAA TCGTCTCGCT 


5401 


GCTATGTAOO 
CCATACATCC 


CGGTOCTACA GAGTTCTTGA ACTGOTGGCC TAACTACCGC 
CCCACGATGT CTCAAGAACT TCACCACCCG ATTCATGCCO 


5451 


TACACTAGAA 
ATGTCATCTT 


GOACAGTATT TGGTATCTGC CCTCTGCTGA AGCCACTTAC 
CCTGTCATAA ACCATAGACG CGAGACCACT TCCGTCAATC 


5501 


CTTCQGAAAA AGACTTGGTA CCTCTTGATC CGCCAAACAA ACCACCCCTQ 
GAAGCCTTTT TCTCAACCAT CGAGAACTAG CCCGrTTGTT TG6TCCCGAC 


5551 


GTAGCGGTGa TTTTTTTCTT TGCAACCAGC ACATTACGCG CAGAAAAAAA 
CATCGCCACC AAAAAAACAA ACGTTCGTCG TCTAATGCGC GTCTTTTTTT 


5601 


GOATCTCAAG AAflATCCrTT GATCTTTTCT ACCGGGTCTG ACGCTCAGTS 
CCTAGAGTTC TTCTAGGAAA CTACAAAAGA TGCCCCAGAC TGCGAGTCAC 


5651 


GAACGAAAAC 
CTTGCTTTTG 


TCACGTTAAG GGATTrrGGT CATGAGATTA TCAAAAAGGA 
AOTGCAATTC CCTAAAACCA GTACTCTAAT AGTTTTTCCT 


5701 


TCTTCACCTA 
AGAAGTGGA7 


GATCCJm-A AATTAAAAAT GAACTTTTAA ATCAATCTAA 
CTAGGAAAAT TTAATTTTTA CTTCAAAATT TAGTTAGATT 


5751 


AOTATATATG 
TCATATATAC 


AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA 
TCATTTOAAC CAGACTCTCA ATCGTTACCA ATTAGTCACT 


5B01 


GGCACCTATC TCAGCGATCT GTCTATTTCQ TTCATCCATA GTTGCCTGAC 
CCGTCGATAG AGTCCCTAGA CAGATAAAGC AAGTA6GTAT CAACGGACTG 


5851 


TCCCGGGGSG 
AGGCCCCCCC 


GGGGCCGCTC AGGTCTGCCT CGTGAACAAG GTGTTGCTCA 
CCCCCGCGAC TCCAGACGGA GCACTTCTTC CACAACGACT 


5901 


CTCATACCAG 
GAGTATGGTC 


GCCTGAATCG CCCCATCATC CACCCA6AAA GTGAGGGAGC 
CGGACTTAGC GGGGTAGTAC GTCGGTCTTT CACTCCCTCG 


5351 


CACCGTTGAT GAGAGCTTTC TTGTAGGTGG ACCAGTTGGT GA7TTTCAAC 
GTGCCAACTA CTCTCGAAAC AACATCCACC TGGTCAACCA CTAAAACTTG 


6001 


TTTTGCTTTG 
AAAACOAAAC 


CCACGGAACG GTCTGCCrTQ TCGGGAAGAT GCGTGATCTC 
GGTGCCTTGC CAGACCCAAC ACCCCTTCTA CCCACTAGAC 


£051 


ATCCT7CAAC 
7AGGAAGTTG 


TCAGCAAAAG TTCCATTTAT TCAACAAAGC CGCCGTCCCG 
AQTCOTTTTC AAGCTAAATA AGTTGTTTCG GCCCCAGGGC 


6101 


TCAAGTCAGt 
ACTTCAGTCG 


GTAATCCTCT CCCAGTCTTA CAACCAATTA ACCAATTCTG 
CATTACGACA CGGTCACAAT GTTGGTTAAT TGOTTAACAC 


6151 


ATTACAAAAA 
"AATCTTTTT 


CTCATCGAGC ATCAAATGAA ACTGCAA.TTT ATTCATATCA 
GAGTAGCTCG TAGTTTACTT TGACCTTAAA TAAGTATAGT 



6201 GGATOATCAA TACCATRTTT TTGAAAAAGC CGTTrCTGTA ATCAAGCAQA 
CCTAATAGTT ATGGTATAAA AACTTTTTCG GCAAAGACAT TACTTCCTCT 



6251 AAACTCACCG AGGCAQTTCC ATAGGATGGC AJW3ATCCTCG TATCGGTCTG 
TTTGAGMGC TCCGTCAAGG TATCCTACCG TTCTAGGACC ATAGCCAGAC 
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6701 ACGGATAAAA TGCTTCA.TGG TCGGAAGAGG CStMJsXrcCC GTCASCCAGT 
TCCCTATTTT ACGAACTACC AGCCTTCTCC GTAtTTAAGG CAGTCGGXCA 



7101 ATTTATCACG GTTATTC3TCT CATGAGCGCA TACATATTTG AATffTATTTX 
TAAR.TAGTCC CAATAACAGA GTACTCCCCT ATGTATAAAC tTTACATAAAT 
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7151 CJUJUUeTAAA CAAATAGGGG TTCCGCGCAC ATTTCCCCGX AAAGTCCCAC 
CTTTTTATTT GTTTATCCCC AMOCGCGTC TAAAOGGGCT TTTOICGGTO 



7251 CCTXTCACGA CGCCCTTTCG XC 
GCATAGTGCT CCGGGAAACC AG 
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